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ABSTRACT 


During the 19E¢€s we will continue to see an increesed 
men OF GiStributed computer networks. Although network 
Use Hes been f£Ounmd to be effective in a wide variety of 


aAcplicatiogs, contintéed network exransion heightens the need 
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Meanphecliwe Mandagsement tc acnieve cptimum pertormance, 
pe vYeeilicy, ane security of network operations. Advances 
im network management have not kept pace with the problems 
Meat eamrse in Network operation. The Community Cn-Line 
Intelligence System (COINS; is a packet-switched network 
comme c ting Orgénizetions iva The bee Pavel sence 
Ccommmency. COINS Emhitits many of tne difficulties taced by 
lamee networks. CeaetS —  Sonlcumuaat NEtWOrkS Reve made 
edvanced tecanology available to a large uumber of users, 
yet the use of aavaenced technology to assist network 
ranegerent has been relatively Limited. This theSicm «wilt 
Sumy the CCINS Wetwork Control Center (CNCC) and explore 
weys that Voice Input/Cutzut (VIO) Technology can be used to 
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imrrove the déey-to-dey management of network operations. 
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IT. INTRODUCTION 


™ Geuvras tecanologzgical there or the 19725 was the 
coupling of two téchnclogies: computing and commune crete s: 
Dramatic reduction in the cost ot digital-tecnnology and a 
less dramatic, out still sizeable, expansion otf date 
EeetinsslON Capabilities have spurred the introduction and 
growth of Sewerds Cfeocrdphicaliy distributed computer 
metworks. Pioneereéed by a combination of government research 
ame private ioditiative, computer networks are becoming 
Peemeaotefiy aveilatie for a wide variety of applications. 

BM@enwecsemtial characteristic of a computer network (or 
more oprcperly, a computer communications network) is that 
Sewer Wiews the network as a collection of computing 
Pemerwmee> Liat previde a rarge of diverse services and 
@ecameiitics (Ref. i: p. 12128]. These computationel 
resources ae dCcecomeules to MUSErS MUnroregn a “~ReELwork 
architecture that is rore or less transparent. In most of 
today s networks, the user must establish a logical 
connection to one cf the network computers and use a set of 
host-dependent commands to access and manipulate date. 
Meueorks Of the ftture are likely to tc beccre increasingly 
Premera rent in the sense that the user will view the entire 
network {and connected networks) as a Sinzle black box that 


mereees a set of informetion processing resources that can 


tat 





be accessed by a commen commana and query language. aheure 
1 illustrates the user’s view o* a present-day computer 
communications network. 

Much of the werk in advancing network technology has 
been spearheeded by The Advanced Hesearch Projects Agency 
Wen) Of the Department of Defense (DoD). The ARPANET, an 
Beeeramente! network employing packet-switching technology, 
mes BECO Operational since 1969. This network connects over 
meme HOSt COmecnters at mcre than forty differert locations. 
THis géographical disrersicn is acconranied by corsideraole 
Meeversity in the tytes of computers thet serve es host 
memertimes. ln aed@ditror to its own oetworking activities, the 
meer technoclogy hes influenced a numboer of cther distributed 
Ree ormation processing frowects toth withio DoD and in other 
pemewrment agercies. One of these efforts was the Community 
@i Lime Intelligence System (COINS) which interccnnects on- 
Jiné information storage ena retrieval systems located at é 
mere OF leocatioas within the U.S. intelligence community. 

Ms With Weny erées of advenced technolcgy, the technical 
emeoceems OF £#COMpUTeEr networking nave been solved fairly 
@asily; the prceblems of n@€tworg management ard contrcl have 
Meer a eed deal less tractable. It is ironic that the 
Gompect “neehanisms fer cemputer sxsetworks, which embody 
edvanced ADP and communications technology, are generally 
rather primitive. The COINS Netwerk Control Center (CNCC), 


fem ‘exempic, has sé€veral seriovs limitetions in hardware, 
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Figure 1. User’s View of a Comruter-Communications Network 
(Reprintea frcom Reference 1) 
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Ser eware, and in Oferator assistance. The purpose of this 
Mesis iS tO study the existing CNCC and to explcre ways 
Paes VOice Input/Output (VIC) technology might be used to 
help the CNCC cperatcrs better manage the network. To 
Meerst in this rrocess, we have designed and implemented a 
Computer simulaticr of the CNCC te help us study the 
Seericadility amd teesibility of voice technology. 

We wall <irst d@scribe the COINS Network and tne CNCC. 
ve will then summarize some of the advances in VIO 
Meermelcrey cver the last several years and analyze the 
extent to which the tecknology is likely to improve the CNCC 
G@peration. #e will tnen describe the design of the CNCC 
computer model and present some preliminary conclusions thet 
were Obtained ty using it. LaS tive, we wis discuss 
implementation strategies, costs, and pctential sitfalls 


associated witb installing ViO in the CNCC. 
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Il. TEE COINS NETWORK 


A. BACKGRCUND 

The Ccmmunity Cn Line Intelligence System grew out cf 
the need OT Uae ieee hl fence agencies to share 
inScrmaticn. COINS I crigirnated as a star network with a 
Tessege-switching computer in the star’s logicel center at 
MOA. In the early days of COINS I operation, the switch 
Mepme ter Served es €- de facto Network Control Center because 
orn aeC meds TOULTCC THrouen it Enabling it to ronitor the 
Eerous Of tre netwerk (Ref. 2: p. 3-1]. COINS II, develcped 
lemme looms, uses tae ARYA packet-switching technology 
Po Crmmmect its nest computers. inere 1s, Sherer crewmen « 
momyeer a Centra: messéeze-switch to rperforr the network 
eoeeeccl Siunction. 

Peemcurtean newrome COGSi1STS of several hest systems 


(some or which are served by PDP-11 front-end processors}. 


ae 


Mees ere commected to the network by Interface Messege 
trocessors (IMPs) interconnected by wideband cormunication 
fines. fhe IMPs form eg reliable and sé€cure communications 
SuometwOTe Which uses a variety of techniques, such as 
Meaprewe rotting, to ensure network performence and 
evaileability. 

Dies a wypiceal CCINS cperaticn, a user at a terminal 


Connected tio sore host (or alternatively toa Terminal 
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Access System (TAS)) enters an interrogation request “or 
retri€val of information that resides at some other network 
eee, “Se Tetrievel Pequest is Lassed from the host to its 
IPP where it is divided inte packets approximately 18@@ dits 
long. These packets are independently routed through the 
MetWercz wntil they arrive at the destination IMP where they 
eee meassembbed into the original interrogation messege. 
Meeer reassemely, the destination IMP passes the request to 
os Tost wcompuver for aetion. The destination hos u 
meoeceeses the interrogatior by retrieving and forratting the 
requested data trom its ‘files and passing the answer, 
thorough the network, back to the requesting terminal [Ret 3: 
me I-53}. COINS hosts provide a varievy of query langvages, 
meee otro uuresS, S HETROrK protocols, and te@rminal types. 
Hdditionally, The COINS Preject Management Orfice (COINS 
PMC) actively aevelops hardware and software corponents to 
faealitate the smooth and rapid exchange of information 
DETWEEn users on diverse nosts. The Techrology Transfer 
Meeeemen Gecility (TTRE), a part of the COINS PMO, is 
eevee ls involved in frojects stch as the DARPA-sponsored 
Maa-Macnine nelationshir Program and the developrent of a 
multiple retrieval lenguage trenslator (cétled ADAPT} which 


mo wie na@ed GO Serve as a common user interface. 





SB. WEPWORK CONFIGURATION 

Semcertually, the COPNS II network car be viewed as 
COBeistimg of four independent layers cr rings as shewn in 
Bieure <. [ (Westee ring consists’ of relatively large 
COMPESeTS Uhal Surport on-lime data bases and rrovide direct 
emmebytic support to iztelligence analysts. Examples incitrde 
mee SiGiNT Cn-line Information System (SOLIS) at NSA and the 
UMPTVAC-111@ system at the National Photo Intelligence Center 
(NPIC}. The Access prccessor Ring cersists cf varicus 
Semeurer SsyStems that ellow a user tc access COINS date 
eetes Mev nOUL 2CiSae tnreugrh a host. Included in this layer 
Bee tie Terminal Access Systems (TES) which are roughly 
G@meeeeus tc very figh-powered TIPs in ARPANET terminolcgy; 
Prome-@ni Processors (FEPs) which connect hosts to the 
COWrPunicaticns suonetwerk; and Gateways, which serve as 
Biverfeces Segpween COINS [I ena other networks. The IMP 
Pome is tHe communications sutnetwork rased upon ARPANET 
Mameeosopiy end tecnnoclogy. The T1 (tetrahedron; ring is 
Memeracu. sty that limms tne IMPS together and raintains tne 


Peeure Communicetions of whe COINS II network [Ref. 4: p. 


Saeeesvre ¢, whieh portreys the current COINS MI! 
i[epomeey, the incependent layers are denoted ty different 
Sempecis: hests are represented by rectangles; access 
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Figure 2. Conceptual View of Coins II 
(from Reference 2) 
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network witn functions allocated both to individual layers 
acd tO individual components on the sane Layer. This 
maeervewes « ghGelee reliability and flexibility since the 
netwcrk can continue to operate ir the face of host, IMP, or 
ccmmunicaticn line failures. A key feature of the network 
is the use cr an aaaptive routing algorithm to pass messages 
Sem 6h6iteiP hCUtolCUdIM?PL Keach IMP is ccnnected te at least two 
Metenbor itPs. At tne time an IMP receives a wessage from 
mes lost, it a@@ci@es the proper route te use tc deliver the 
M/eccaee tO its destination based On the current stete of the 
BO LyCTz. ror exarple, to route 4 message from IMP 1 to IMP 
ey there are three lines from which to choose. vorrellive 
the direct line (line 1) would be the best approach. I+ IMP 
Mmemects Um@el line 1 is either down or heavily loadea, it 


mueeeeteose tO Toute the message indirectly over two other 


ossible routes. 
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C. WETPWORe MANAGEMENT AND CONTACL 

Manéegement of a computer network encompesses all facets 
or Operation Pocimeaing  Mmome-ranzge planning, training, 
Staffing, end bducgeting. In this peper, hewever, we will be 
foewercami sy concerned witn day-to dey management and control 
G@permeoerers rescurces. The chief prerequisite ter effective 
Sreretroma! control is tne ability to rapidly diagnose and 
meereena te sailures in the netwerk. These functions are 


pew@ormed bY the CCINS Network Controller who uses the COINS 





memes Control Center (CNCC) computer te assist in the 
momitoOring and control of the network. The CNCC and the 
Controller wil! be discussed in greater detail in the next 
Section. Eefore doing so, however, we will point out some 
Or See ezenenal protlems of controlling a distribured network 
@ae adisceuss some Of the longer range plans for CCINS network 
fanagement. 

The COINS ccmmunications subnetwork can be considered a 
Mmetriougped computation system [Ret. ¢: yp I-S], since 
message/pacxet preccessing, network status monitoring, and 
Meourpte reporting are performed independently by each IMP. 
fs Ggistributicn of functicn, while ensuring better overall 
Memewenitity and <scervice, does complicate the problem of 
Mmepeectime aba correcting j;roblems with network components. 
Pe@erevnicel distributions further aggraevetes the contrel 
meopmem. Ime CNCC wae established to address the problems 
Dueeeereeived corvrol. It serves the multiple functions of 
continually ronitcring ane diagnosing yee (0\19) Mel eg areas, 
Peestenatine cOrrective measures, end previding é& central 
Memtimee CO Wicd USErS Fey direct inquiries and corplaints. 
Bacon iMP is progremmed tc examine itself end its interfeces 
Memmegicdity and report the results to the CNCC. The CNCC 
eowlhects these (pessibly Comal web ine ) IMP reporis, 


@evermines the most likely true state otf the network, and 


rea 








peo tS )«€6thhe)6©6required)6reépair activity in the event of 
pemerted failures. 

The COINS FMO is planning to implement a sully automated 
ieee §6©6S6hhmUSEECTChCUMffOUr ctmed« Collecting, editing, analyzing, and 
feeeevine Cc. nemarcrk inagcrmatiom. The infecrraticn will be 
fees, )6hLCO OONTOr the crperetions, performance, and utility of 


eee COINS Network. This etfort, known as the COINS Network 


“? 


enegement System (CNMS), is planned to te implemented in 
Smeees Over the mexzt five years. This paper is intended to 
Geeplement thet effort by exploring some remediel steps thet 
can be taken tc irprove the performénce of the existing CNCC 


Mmet later could be incorporated into the enhanced CNMS. 
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Tit. gteCOINS NETWORK CCNTROL CENTER 


A. CONFIGURATION AND FUNCTIONS 

AS noted atove, the CNCC maintains a dynamic picture of 
the status of é]11 network components (IMP’s, FEPs, INIS, and 
lines) and is thus able to direct recovery rrecedures in the 
event Ont ccmponent failures. The CNCC machine, a 
Bomeywell-SiG computer, is attached to the subnetwork ty 
means of a local host connection to the network’s master IMP 
(IMP 1). The operation of the remote IMPs can be controlled 
either ‘Yrom the CNCC or directly from the Master IMP. The 
Come nol Cemter operator is responsible for both machines. 

The CNCC uses twe teletypes (Kncwn as the Logger TTY and 
the Summary TTY) as primary output devices. The system uses 
the Legger tc print event-oriented messages as they occur. 
Logger messages are often concerned with alert iaformation 
Pettaining tc the status cf network ccmponents and will 
Preaguestiy require action by the Network Controller. The 
GNCC has an alarm vox wired into the logic of the Lcgger TTY 
which keys on the presence of a sentinel chéerecter (ASCII 
contrcl-G cr BELL) in a Logger message. Thus, by changing 
Tee tert of the Logger mesSages, any message can De made to 
set off the alarm box and alert the operatcr. The Summary 
evemepe 915 used to print periodic reports and elso serves 


aoeome primary command inrut device for the CNCC operator. 





The CNCC can also communicate with other COINS hosts such as 
the UNIVAC 494 host at NSA. These oer =O — ois t 
communicaticns are used primarily to transfer network status 
end summary files for further procesSing and reporting. One 
of the fundamental otjectives of this thesis is to evaluate 
me Se Of vOolce input technology to perform the input 
oumewaom of the summary TTY and voice output technology to 
pepiorm the acticn-alert functicn of the Logger TTY. 

The CNCC program is driven by three sources: (1) reports 
fem, whe network IMPs, (2) a real-time clock, ard (3) input 
commands typed on the Summary TITY by the operator. 

Bach IMP sends two reports tc the CNCC computer at least 
once every Se seconds. The first of these, the status 
Meport, provides information about the cperaticnal condition 
of the IMP itself, moaens, and coOnnected hests; the second, 
ae throughput report, provides data throughput information 
mm terms of both words and vacxets for all modems and hosts. 
If these reports suggest a change in stétus of any netwerk 
cOmponent, the CNCC will output a message on the Logger TTY 
that may require acticn by the GNCC @verator. The 
Seaumstical information is accumulated and outEut cn the 
Simmeany Tih either automatically, in respcense TO an 
meme rupe from the realtime clock, or on démand, in response 
to a command trom the Network Controller. 

Mee GHCC srogrem accepts commands typed by the operétor 


Coeeeune oummary ITY and performs e variety of functions to 
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wemver did control the network ccnfiguraticn. These 
functions include reloading or resterting an IMP, testing a 
mest cr modem interface, cr selecting one of the statistical 


Summary reports for display. 


EB. ROLE OF THE NETWORK CONTROILER 

The COINS Network Cortroller is responsible for overell 
Operation of the network. furit Laportantt. tO, 10te etihat. 
although we often refer to the Network Controller as vee 
operator , his duties are considerably broader than those of 
Se computer operatcr in 4 traditional ADP center [Ref. 5]. 
The Network Controller’s role includes the coordination and 
monitcring of netwerk perfermance, production cf netwerk 
Status reports, diegnosis of failures of network components, 
@te @irection oF recovery rrocedtres. Given the distributed 
Pemmre Of the COPNS I1 network, the Controller hes first- 
OPGer responsibility Yor retwerk operaticns. The major 
Gueees of the ‘Network Controllér are summerized in tris 
Saevron (fer. S$: pp. I-12 - I-11]. 

Trewivewrork Controller is responsible, first of all, for 
Opewatine the CNCC conrplex. This includes the lLoadineg, 
Starting, and normal operation of both the CNCC and the 
Wasver PRP programs. PeerOurn “ln this thesis @we are 
prifarily concerned with the CNCC H-S1E computer, the CNCC 
Network Controller is responsible for a much larger hardware 


suite. Other computers in tne CNCC complex include the 





PDP-11/70 Terminal Access System (TAS) that intertaces 32 
user terminals to the network; the Network Service Hcst 
(NSE), another PDP-11/72 system used fcr research end 
develcpment and network software suppert; and several other 
special-purpose computer systems (Ref. 6: p. 2-4]. It is 
planned that ultimately the Network Ccntroller could meniter 
ali these systems from a single integrated orerator console. 

The network monitoring activity ccnsists mainly of 
observing the CNCC Logger reports and properly interpreting 
Gamer @ad Tallure indications. Duties include coordinating 
merwork Status with the COINS PMO, scheculineg of special 
network tests, sawmering performance and througnput 
Statistics, verifying the programs at the network IMPs, and 
bme coordinating any needed maintenance support. aS 
Network Controller is also responsible for monitoring the 
CNCC hourly summary reports, requesting additional ré&ports 
7? smecessery» and off-loading report files end network 
Statistics to the TIPS1 system at NSA for further rrocessing 
and analysis. 

Tee most important duties of the Network Controller are 
Riese “eat ateect the day-tc-day cveration cf the network. 
These include diagnosing network or user prev lenms, 
conducting network WesSvom lO measure Deril crmapce, said 
releasing new software to the network IMPs. The Ne@tGworez 
Contreller’s fundamental responsibility is tc keep all 


Pomwore components operating at a level thet assures users 
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of reliable and timely service. Additionally, he serves as 
the focal point fcr questions, problems, and complaints 


about all aspects of network operetion. 


CG. CNeC INPUTS AND OUTPUTS 
1. Summary Teletype Output 

Puc mess@aees primed on the Summary TTY contain 
network informaticn that has teen compiled by the CNCC 
COmeuter from IMP status and trouble reports. Summary 
MemertS well into three general categories: those that 
meovrmee information on the current status of network 
compenents; these that provide accumulated summaries cf 
Status information; end those that provide accurulated 
summaries of host and line throughput[Rer 2: Appendix RB]. 
The Network Controller can select any of these reports ody 
entering commands at the Summary TTY. in. ddditicon. 
eccumulated summaries of throughput and status information 
are OUTpUL automatically every four or eight hours. fhe 
apprepriate operatcr commands are described beicw. 

Twere are two current status rerorts: QUICKPRINT end 
Meer STATUS. The QUICKPRINT, a Sample of which iS Shown in 
Meegere ¢, uses one-Character codes to describe the current 
peers of all IMPs and lines ir the network. This report 
@ise displays amy unusual situations (e.g., sense switches 
me. OBER IDS ENABLED, etc.) The following status cede 


@efeeitiens are used in the QUICKPRINT report: 





Meco BUGUST 5S 1vel QUICK SUMMARY 


Paoeceredo Tels 
tas aes tata, 
LINE 


TP 1 So 4 ON 
Voc GaN ON 


Figure 4. Sample QUICKPRINI Report 
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’ - status unknown 


3t 
1 


down 


foes DEUS Side of iine down 


[eo —rnus Sige Of line down 


Pacers Sidesor Line looped 
© > Smemus side of lime lcecped 
SB» - both sides of line looped 


mee eH@iT STATUS reyort gives the current status ot all hosts 
Pe “time network or of cnly these hosts conné€cted ta a 
apeemiirc IMP. “A sampte HOST STATUS report is shown in 
Pigure 5. 

Mme CNCC accumulates Status information on all IMPs 
and lines cnadaily basis. This information is displayed 
mee ume STATUS report. Status is reported in terms of 15 
Mnure segments such that an entry is printed for a time 
period only if the status of one of the components changed 
Ourime that interval. The STATUS report emplcys the same 
Set ef codes used in the CUICKPRINT with the addition that, 
me me Status of any IMP or liné has charged auring the 
interval, its status will be shown as “X'. A Sample of the 
Peels Report is shown in Figure 6. The IMP and line STATUS 
peperts are automatically summarized and output three times 
Aa day, at @&8@2, 1600, and <400. 

MeawoucC complles throughput stetistics cn the 


emount of treffic transferréd between hosts end over the 
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poco QWs? 5 1981 HOST SUMMARY 


IMP 
123456 
IMP i: MOST @ (VIRT=HOST 1) UP 
EOST 1 (VIRT-HOST 7) DCWN 
ECST 2 (VIRT-HCST 12) DCWN 
IMP 2 EOST 2 (VIRT-HKOST 2) UP 
POST 1 (VIRT=HCST 12) UP 
BOSH 2 (VIRT<“HOST 13) UP 
HOST 3 (VIRT-HOST 14) DCWN 
IMP g HOST @ (VIRT-EOST 3) DCWN 
EOST 1 (VIRT-ECST 15} UP 
HOST < (VIRT-HCST 16) UP 
IMP as Bec CwmprRT—-HOST 4) UF 
IMP 5 HOST @ (VIRT-HOST &) DCWN 
SOST 1 (VIRT-HCST 2z) DOWN 
HOST 2 (VIRT-hHOST 24) DCWN 
TMP HOST 2 (VIRT-HOST 6) DCWN 
HCSR 1 (VIRT=—HCS7T 26) DCWN 
FCST < (VIRT-EOST 27) DCWN 
YOST 2 (VIET-HOST Ze) DCWN 


tay 


cn 


Seige ve hO0o7 STaRIUS 
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AUGUST 5 19&€1 COINS NETWORK SUMMARY @@00-1E00 
WP STATUS 


TIME 125456 
CdCo Cs I 


1039 XAXKXK 
1@45 . 


LINE STATUS 


a WeeeeeeeSO 1254 


ZECY Chee fe 
12SL KAXCCRAA?e PECK 


1 Oe tmngens ft ON es 
hel. io (ACME TE IT Ie 
1ES5¢ eS oe ara ae ae 
1345 ce eine doce 


Wafure S&. Sample STATUS Repert 
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Bitre> OF the Wetwork. This information can be displayed in 
either of two reports: the hourly THRUPUT summary report and 
the €ignot-hour DAYSUM report. The tormats of these reports 
ame id@wticals the statistics, however, may differ scmewhat 
eepending on the time of the reqvest. 

Ie the TERUTUT rerpert, Sbown ia Figure ae 
PerOmpmation is arrangea by IMF and within each IMP by 
physical bost address with the virtual host address (VHA) in 
parentheses. Fost threughput is reported in terms of the 
amimer Of Messages and packets sent and recé@ived for both 
mre, Site and intra-site traffic. The ?ollowing symbology 
Beolicem LO denote the information unit and rath direction in 


tee IHRUPUT rerort: 


M->N Messages sent to fFosts et other network sites. 
M<-N Messages received trom hosts at other sites. 
P-7n Peewets sent to hests at cther sites. 

P<-N Packets received from hosts at other sites. 
N=>s Messages sent to hests at the same site. 

Me=-S Messages received from hosts at the sare site. 
P—>95 Peewet>s SERt to Hests gt the same site. 

ao Packets received trom hnosts at the same site. 


Throwehput figures less than 5¢,7€7 are exact; those 
petween 22,76E€ and 2,997,151 are accurate to within S54; and 


Saerse eneater than 2,097,151 are shown aS +ttttt++. 
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AUGUST = 19&1 COINS NETWCRXY SUMMARY 1400-1529 eytea ts 


Boe HOSt 1 HOST 2 HOST 2 


IMP@1 =. (oeeesteal) ( 1@) 
M=—>n 2eUS Zee 

M<-N EY& 2962 691 

F- *N 8439 Sel 

SC raa EYE Elles EE? 

M<-S 14¢ 

P<-S 142 

IMPO< ( 2) Ce) ( 13) 
h—>N gg Oe 17ée> 117 

M<-N 17@& 1E¢8 lec 

iP >h 5672 Sol ooo 

P<-f 1E9@ 1Y¥2t era 

tm ee < (pe lis) 

M—->N cQl 

MC=N E69 

P->N 729 

EN 720 


MUGUST 2 1961 CCINS NETWCRK SUMMARY 1400-1500 PAGE 9 
Lime THRCUGHPUT 
meee)60OU se TOP UiMPe@e 6ypeaee)6 61M Oe TO UIMP2L) 611299 


Meee cs INTPYI TC IMPGE 1705 IMPZGE TO IMPG1 Tez 
mee CS TPES TO UIMP@S 6 6coeSe 0 6 6UIMPES TO UIMPES Belo 


aiipure 7. Semple THRUPUT Report 
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e. Logger Teletype Cutput 


a. General 

Mesdescription of the Summary teletype outputs 
was relatively straight forward, since the outputs were 
melli-derined and fairly predictable. Such is not the case 
meen we consider the outputs that appear on the Logger 
teretype. We recall that the Logger TTY is used for alert 
Sepe Smessages, meny ot which ré@quire sore action by the 
Network Controller. Appendix B o* Reference £ contains a 
list of 93 possible output messages that may be sent to the 
Logger TTY. Many cf them are merely informaticnal in nature 
ana ane useful] primarily tor later diagnosis and tracing of 
events. it is, therefore, neither necessary nor desirable 
mom  OUr purposes tc describe them all. Instead, we will 
present the geneéral 7orm of the Logger messages and describe 
Daese that pertain to the overall performance of the 
network. The ressages with which we will be concerned are 
@Melply thcse that will activate the COINS alarm bex to 
audibly alert the Network Controller. These messages are 
the cn@s that weculd be candidates te be sent tc a voice 
Seeput device. A fuller discussion of Logger messages 15 
feond Gn Rerenence 3. 

All Logger messages have the following general 


TOTrm: 


<time> <repcrting net comporent>: <event descr>d 


o4 





where <reporting net corponent> is some IMP or kine and 
<event descr> is a brief message describing the specific 


event. Some examples are: 


@7as (ier. “VRE =tOSsT 12 DOWN 
meee LONE €: ERRORS MINUS 5/6 


1538 Me oe: WRAP (74, 491, 1) 


In the remainder cf this section, we will describe some of 
the more important Logger TIY messages. 
Bb. Ba) HOST DATA CHECKSUM 
Each IMP has internal tatles on which it bases 
message routing and delivery decisions. Should one of these 
meobes become disturbed, for exemole tecause cf a program 
crash, this message will appear on the Logger at one minute 
intervals until the table is repaired. 
c. HOST N DOWN 
Perr ls SEecporting that physical host N has 
aeeppea its ready line. A variation cf this message is 
“VIRT-HOST N DCWN which provides the same information for a 
'hest that is assigned a virtual host number. Fecause lccal 
Mest Operation is not a CNCC responsibility, this wessage is 
Meaipiy to provide information that might be used to answer 
ser queries. 
qd. IMP Ne DOWN 
The neighbors of IMP N have reported that the 


lines to that IMP are dcewn. The CNCC has decided that the 





IMP is not available for network service although it may 
still be actually running ana able to communicate with its 
Moca fhos ts . 
e. LINE X DOWN 
The IMPS at either end of line X have declared 
eae lime dead. The CNCC makes the decision to report the 


line dead. 


re 


LINE X DOWN PLUS (or MINUS) 
The end of a line which is connected to the 
higher numbered IMP is defined to be the “plus” side of the 
line; that connected to the lower sumbered IMP is the 
“minus side. This lcgger message is similar to the 
previous one except that the IMP at only one end of the line 
is reporting a problem. 
ge 6 TRAP | OC Ty, X) 
ieee reporters IMPs has detected an dnternal 
enomaly. the T, A, and X valves provide further information 
about the trap. Usually these messages are of interest to 
Betwork scftware personnel cnly;, some of them, however, 
meOtmeesoOperator action to clear up the error conditior. 
be ©CRASH” ( T sulgah ) 
limes mesSase sindicates that the reporting IMP 
has detected a serious prcgram malfunction that caused the 
Beogram tO Crash. It has reloaded itself and the program is 
now up and running. T, A, and X contain register values 


useful to software personnel diagnosing the failure. 


56 





PD. WETHER CST=TBL Xx 

This message indicates that the serial rpnumber of 
the reporting IMP’s virtual host adaress (VHA) table differs 
from the master in the CNCC computer. Appropriate operator 
ection is required to Make the serial numbers consistent. 

owe Hetmork Controller Commands 
a. Generel 

As was the case with the Logger TTY ressages 
there are a large number of CNCC operator commands thet do 
not concern us. We will be concerned primarily with those 
that are used frequently in netwerk mecnitoring and 
troubleshooting. These are also the commands thet are good 
candidates for vcice input. Commands to the CNCC are 
preceded by the commana sentinel character “? and consist 
of a i-3 character command mnemonic followed, optionally, by 
edditional command-dependaent parameters suck as filenemes, 
IMP numbers etc. In the discussions that follow, operator 
entries ere in upper case and underlined; prcempts and 
P@sipomses by the CNCC are all in lower case. Expressions ina 
angle brackets, e.g. <FILENAMED, are used to represent ASCII 
Strings. The $ character is used to denote the ASCII 


escape character (cctal 33). Some examples are: 


iwePort Daysum: 


7eProe@cast Tile: <FILENAME>S imp: <#>$ 





CNCC commands can be grouped into three broad 
categories: commands us@€d to produce summary reports; 
commends used to manipulate local CNCC files; and commands 
used for remote transmission of files between the CNCC and 
Other network components. For a complete list of CNCC 
cemmends, see Appendix E of Reterence &$. 

db. Summary Revert Commands 

1O reguest a specific summary report, tac 
Gemercllier types ?REP and the first letter of the report 
type (e.g. D for DAYSUM report). The command is terminated 
iy typime acoton. Theefollewing are the ccmmands “fcr the 


various summary reports: 


TREPoOFt Quickp: 

@ea@Port Host status: 

FRR Ort Host status imp:<#>$ 

SREPOrt Status: 

meeort Paorptsum: 

?REPort Daysum: 

e@- Eoecal £1le Commands 
Decal fie cofff™ands allow the operator Lo 

perform powwrme op@rativons on klccal CNCC Tiles. The 
following @xamples illustrate the types of commands 
available. for a mere ccmplete description see Secticn ill 


oT B@ference <. 


o& 





ferisot Directory 


@Print Total (disk space available) 
?REName (old) <FILENAME>S (new) <FILENAMEDS 
d. Remote File Transfer Commands 
The commands used to transfer tiles between the 
CNCC and cther components in the COINS network are prebdably 
See MOSUt importamt of all the operator commands. igual 
mpmemweae the Netwerk Controller with a means of reloading 
wemnose IMPs and correcting other network problems. One of 
the most frequently used commands is the EROADCAST command 


marcm Has the following general form: 


?Broadcast file: <FILENAMES imp: <#>$ 


where <FILENAMED determines the specific action and <#> is 
eee Lumber to which the fileswill be sent. Numerous 
examples of the use ofr tniS command can be found in 
Reference 5. 

emconmecwecertain types of checksum errors, the 
Controller uses the RELOAD command which has the following 


eeneral form: 


?RELoad file: <IMPSYS ####BIN#ADS 


modem: <#>$ imp: <#>$ 


where <IMPSYS #### BIN ##> is the name of the latest version 


Oe umes WP prcegram and <#> rezvers to the modem rumber and 
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the IMP number of a neighbor IMP to the one reporting the 
GCAECKSUM Errors. 

On Occasion, the operator wants to transfer a 
file from the CNCC disk to another COINS host. Cften, these 
are statistical files sent to another host for follow-on 
peocessine. This function can be accomplished by the SEND 


Gemmand Which he's the following form: 


?SEnd local-file <CNCCFILEDS 


to host <#>$ remote file name<REMFILEDS 


Mere <GNCCFILE> is the name of the CNCC Yile to be 
transferred end <REMFILED is the name that the file will 


have in the file system of the remote host. 


D. LIMITATIONS OF CURRENT CNCC CONFIGURATION 

me Cuerentay configured, the CNCC has a number of 
SmorLtccmumes that make it more dit?ficult for the Networs 
weomerolver to perform the basic functions of network 
momiterime and control. Altnovgh net all cf these problems 
eee Permeme TO the current inquiry, it is worthwhile to 
([rocwss Cech of them ovrierly to put the existing CNCC in 
perspective. 

mee fost oovious limitation is the relatively primitive 
Donomecne in the CnNCC. The CNCC computer has only 32k bytes 


cf primary memory and cannot be expanded; this places 4 


severe constreint on its ebility to manipulate and enalyze 
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Betwork Status information. ieee wis-- lostmri cient disk 
peomeee tc save all the raw data that is sent to the CNCC. 
As a consequence, data must be summarized before it is 
amemea With a resulting loss of information. The CNCC 
cutput devices are quite slow and it is possible to lose 
tratf@ic in peak situations. bGeis Wworthenactine that a 
number of COINS hosts have completed major hardware upgrades 
while the CNCC hardware still represents the technology of 
mee early 1970s {Ref. 2: p. 3-3]. 

iere are similar problems with the CNCC software, both 
with the analytical routines and with the procedures that 
Paremreace directly with the operator. Many of the output 
Mmecseges are cryptic and require considerable experience and 
emesswork to interpret correctly. The py t command 
Momsveee is alsc not very human-oriented. Some of the 
Semeechive procedures are cumbersome and trcene to error. 
fOr Cmample, the procedvre for relcading an IMP can 
sometimes as many as eight fairly complicated steps [Reft. 2: 
me Se]. Huranized, self-contained output messages and 
Simple macro type inpvt commands would go a long way to 
Seeeweme Tee job of the Network Contrecller. 

One of the most serious problems facing the Network 
Controller is just the number o? individual consoles that 
Must De monitored. as notec aoove, the CNCC operator is 
résponsible for the cperation cf several computer systems in 


addition to monitoring the Logger and Summery fTTYs. While 
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these systems are all in the sare general area, it is not 
possible tc monitcr and operate ail the systems effectively 
without a considerable amount of physical movement. One 
Study showed that, in the course of a typical week, the CNCC 
cperator made 802 separate trips between various positions 
mee cCewerea a distance of over five miles. More significant 
than Gola lL distance 1S the fect that “whe tregvent 
Maeerruptions Make it hard fcr the Controller te maintain 
Seeeecrve continuity over all the positions. He will too 
c¥ten be at the wrong conscle at the wrong time and thus 
miss some important messages. 

These shortcomings cannot be eliminated by a riecemeal 
approach. Ultimately, the CNCC will heve to be replaced by 
@ Mmoad@rn computer system that enhances and sirplifies the 
job of network management. Plans for such an upgrade have 
meen developed by the COINS PMO. A turnkey replacement 
system is, hewever, several years away. Advances in Voice 
Input/Output (VIC) technology over the last several years 
prcevide the potential to make some impertant interim 
improverents et comparatively moderate cost. In the 
Pemaimaer cr thisS paper, we Will take a clcser lcck at voice 
technology and examine waysS that it might be adepted to the 
Seist ram System to increase the effectiveness of tne COINS 


Network Controller. 
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IV. VOICE INFUT/OUTPUT TEHCHNOLCGY 


A. GENERAL 

BS Weemoted in the previous secticn, several cf the 
problems faced by the CNCC Neé€twork Controller could be 
Cateforized as shortccmings in human factors engineering. 
Ome Way of improving the human aspect of any system is to 
Simplify the procedures used to communicate the intentions 
of the human cperator to the system. Another is to provide 
embetter Way of motifying the operator that a significant 
event that requires his attention has cccurred. What we 
Bild attempt todo im the remainder of this thesis is 
explore and highlight ways of achieving these two 
fundamental improvements to the current CNCC system. On the 
input side, we will examine the uses of Autcmatic Speech 
Recognition (ASR) equipment; on the output side, we will 
Gomsu@er tre use of voice output techniques, such as speech 
synthesis. Input and output will be discussed seperately in 
mere @c@pall in 38. and C. delow. in the rerainder of this 
section, we will outline the overall approach of the 
investigation. 

‘as approach tyaditidcnally used to evaluate the 
Pore@uitzal of ASH for a particular application has been to 
conduct what are called (Ret. 7: ov. 4] “evaluation 


experiments . Svaluation experiments are the most rigorous 
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wee OF CXPETIMent and usually involve careful control of 
experimental conditions, a number of replications, and a 
formal analysis of numerical measurement data. Reterences 8 
and Y are good examples of the use of evaluation experiments 
mo G@etermine the usefulness of speech ré€cognition in a 
Farticular application environment. There are a number of 
advantages tc this approach, the most obvicus being that the 
conclusions are bolstered by hard statistical evidence. The 
G@rseaavamiage is that it is often difficult and expensive to 
develop experimental models that realistically portray 
Real-world situations. 

For a number of réesons, we decided not to use a formal 
Ewabuation experiment for the CNCC study. First of all, the 
CNCC is a complex man-macnine system that is extremely 
eer vert to model realistically in a controlled experiment. 
Events in the retwork are not deterministic and by 
Tavrouucing the controls necessary for a formal experiment, 
meoemen tee risk oF building a model that does not accurately 
pemeeras, the real system. Related to this is the difficulty 
Of obtaining people with adequate background to serve as 
experimental subjects. CNCC operators were nct available 
ema eren subjects with extensive computer experience would 
meme to have bengtny svecialized training to participate in 
Swen 48 G&iperiwent. The fact that we were looking obdoth at 


Meree” “ert and voice outrut was an additional complication 





Maat could necessitate additional replications and hence 
Pperease the cost of the experiment. 

A MWOTe positive reason for adofting a different strategy 
was that a formal experiment was really not necessary. 
Automatic Speech Recognition has matured to the extent that 
we can feel reasonably confident in extrapolating results 
from earlier experiments. One of the landmark experiments 
in ASR involved a group of 24 experimental subjects whe used 
Voice recognition equipment to verbally enter commands to 
fee ARPANET (Ret. &]. The subjects carried out a predefined 
Seemario using both voice input and typing input. The 
Subjects were also required to frerform a secondary task 
during any free time that they had. The results of the 
experiment [Ret.8: p. 2] showed that with only three hours 


ee “fanning prectice: 


== Woice imput Was 17.5% faster than manual typing; 
—“siameai taping input had 183.2% rore entry errcrss and 
——"soice imput allowed subjects to complete 25% more of 


the secondary task than did manual typing. 


Memes eu@ar tnat there are a number of similarities 
between the scenario in the ARPANET experiment and the 
Opereatviomal Environment in the CNCC. The CNCC operators are 
also involved in entering ccmmands tc a distributed computer 
network that uses ARPANET technology; the operator is, of 


course, always perfcrming a seccndary task since the Netwerk 





Controller is continually occupied with responding to output 
requests on a number ot aisplay consoles. We believe, then, 
that the general ccnclusion cf the experiment, thet it is 
feasible Lo use current commercially available voice 
recognition equipment to run many operations of an ARPANET 
type network, can also be applied to the CNCC. 

In terms o? vcice output there is less hard experimental 
evidence of apolicability. There are some examples, 
however, where voice response units have been used in plece 
of traditional alarm devices; Oe Ser lin eh eal lent vy 
attractive, furthermore, that voice cutput would provide 
Gomsideérably more information than a buzzer or bell. 

The relatively moderate cost of current veice input and 
Output equipment is alsc a factor in deciding against a 
full-scale evaluation experiment. noe One das there As 
Meesonpable evidence that the technology will be useful, 
moe 1s £Airly lcw econcric risk in trying tc apply it. If 
meoerbaactc apelicability can be verified; the feasibility 
demonstrated; and one or more pcssible implementatior 
alternatives developed, there is a high probability of 
Successful installation and overation. 

Ome amet ncde Gey, therefore, has taken two different, but 
related, paths. First, we have researched the field of VIO 
technclogy to understand the capability and cost of the 
available equipment and its potential applicability in the 


eueo. Second, weehnave inrlemented a tréining simulation 








model of the CNCC that is useful both to demonstrate the use 
of VIO in the CNCC and to permit experimentation with 
alternative implerentaticn approaches without interrupting 
the production system. he model can also be used. to 
familiarize CNCC operators with voice technology equipment 
befcre actually installing it on line. Besed on these two 
parallel efforts, we then aeveloped some strategies for 
imsteallimg YViO in the current CNCC witnout major 


modifications to any of the existing hardware or software. 


B. VOICE INPUT 
1. Typical Applications of Voice Input 

Until the mid-1970s, autcmatic speech recognition 
(ASR) was used primarily to develop voOcoders for narrow-bana 
Speech ccmmunications. With the increasing availability of 
high quality, roderetely priced systers for speech 
Deeemno ite On, however, the number of spotential and actual 
applications has mushroomed. Limited vocabulary vcice input 
Erstems Dave moved owt of the laboratories and are now 
Gperating in a variety cf applications in bcth private and 
pple Secuors. #xamples include automated sorting systems 
mom Sime distribution of parcels, containers, ard baggage; 
voice programming for machine tools; quality control and 
miemeervon; air traffic control; entry of cartograjhic data; 
and aids for the hendicapped [Ref. 12: pp. 182-186]. While 


tais large number of yotential uses precludes detailed 
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meoeuoslOn Of them all, it would be well to briefly discuss 
a few of them before we try to generalize about the 
Characteristics of applications where ASR is likely to be 
puceessful. 

Voice input devices nave been used in material 
handling applications since the mid 1970s and have resulted 
Dae rmerTeased operator productivity and reduced error rates 
over their predecessors that required operators to hand key 
foe sortase destimnatron codes. A typical emample, is the 
voice control package routing system developed at $.S Kresge 
Palos. 8S percels arrive at the induction station, the 
operator simply stetes tare destination cede for each package 
into his microphone headset. The computer recognizes the 
Spexen destination code and generates the apprepriate code 
as if it had come from a keyboard. This “sorting vy speech’ 
Mmereased productivity and eliminated ea serious problem of 
“punching of different size packages at the induction 
Bhatdon (Ref. 10: r. 185]. 

A longstanding Devured cen in Sciplenravaseda 
Manufacturing control systems has been the delay in getting 
Machine tocol programming requests translated into working 
software. Taeettvonally, factory personnel would identify 
necessary modifications in the form of drawings; these would 
be converted to specifications; translated into a computer 
Proeram manually; converted to tape; and, finally, installed 


mreeeoenme machine control unit. A form cf voice programming 





has been developed that allows tactory supervisors to 
girectiy comtrol the computer-based prcecesses. Voice 
programming for numerical cortrol (VNC) allows the factory 
personnel to speak commands in stylized English that are 
automatically translated into a computer-compatible format. 
No Special programming Skills are required and the 
“programmers hands are free to handle dluevrints or perform 
Supporting calculations. VNC has transformed what was a 
Seven Step process fraught with the danger of error that 
creeps in whenever complicated human communications are 
required into a workable four step process under the direct 
control of the people using the results [Ref. 10: pp. 1&&- 
186]. 

ASR has not been used as extensively in the public’ 
sector although experimental results and linited overational 
experience seem quite promising. As noted above, Foock 
(Ret. 8] has demonstrated that using voice input to exercise 
a typical scenaric on the ARPANET was significantly faster 
and more accurate than manually entering tne conmanas. It 
-has also beer used operationally at CINCPACFLT to access a 
Deamee intelligence data base and has been found to have 
Gonsiderable petential for use in the producticn of imagery 
interpretation reports [Ref. 9: p.9&]. A Significent amount 
of research has been performed for the Deferse Mapping 
Agency (DMA) on ways to use ASR in such applications eas the 


mrocessifig of Digital Landrass System (DLMS ) data, 





PeeraretioOn “Or Flight Information Publications data, and 
oceaen-depth measurements for Grerti zed Car toerap hic 
applications. These cperations all take place in an an 
environment where both hands and eyes are continuously busy 
and frequently involve the use of stereo optics or other 
special equipment. Voice has been shown experimentally to 
m tastem €asicer, ard less fatigving to the operator than 
fore traditional methods of data entry [Ret. 9: p. 327]. 

Lest this discussion appear too one-sided, there are 
mey applications for which voice output is either not 
desirable or not ccst-eftective. It has been shewn tc be cf 
ferrvea wtilityvy for the preparation of percformra messages 
oma, tO BE no Taster than typing for entering commands tec 
the warfare Environmental Simulator (WES), e wergamre used on 
ene AOVanced Command and Contrel Architectural Testbed 
(A@@A) . Becamoe of limitations of today’s te@Hnoloey, 
Merri cantons tidt require speaker iIndevendence, vocabularies 
Preever “Gian about £30 words, or récognition of continvous 
Sweeenmare MOt practical. From the amcunt of experience we 
Dawe, Bead to date, however, it is possible to identify 
Cepwern Shdracteristics thet Make an applicetion a good 
candidate for voice input. we well discuss tmese 
G@areeterastics in the next section. 

2. semrerreerePistics of Candidate Applications 
me view of Qhe erowth in operational use of voice 


input it should vbe possible to identity a set of task 
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Situation characteristics where the use of speech 
recognition is likely to be beneficial. Reddy [Ref. 11: p. 
6@] presents such a list that he adapted from an earlier 
study. Using this list as a starting point, we can define a 
Bpeetile of Chamactveristics that make a particular 
application a g00d candidate for ASR. It is unlikely thet 
Meany Operational situations will exhibit all of these 
s@atures, but the existence of Even two or three provides a 
Strong suggestion thet the use of speech input should bde 
Pramined. 

ome simemwe characteristic that almcst all of the 
successful applications have in common is that they all 
involve situations where the operator’s hands and eyes are 
Saceady busy with other ‘functions. In many operations a 
Geod deal of productivity is lost when the operator has to 
interrupt his cther activities to input data via a Keyboard. 
fomce input provides an PIGempenaenL, high-bandwidth 
Communication channel that is tailcr-made for hands~-busy 
operations. 

Memeated tosthe linst consideration is the condition 
where the operators must frequently move away from the basic 
PBEUt Station to attend to other cuties. Wireless 
transmitters the size of a cigerette pack can previde a 
considerable range of operation permitting the operator to 
Communicate with an ASR system without returning to the 


input console. It is interesting to observe that the use of 
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voice input in such situations might suggest, or even 
necessitate, the use of some type of audible feedback system 
(i,e. speech outyut) since the operator will not always be 
in the vicinity cf the speech input device or its associated 
teletype or CRT. 

Some applications involve what might be called 
“virtual movility requirements, in addition to or instead 
or’, the * pgs teal mobility just described. For example, it 
mae ee Hecessary tO access a number of different computer 
systems through a common terminal or console. The ARPANET 
experiment provides a good example. In the ARPANET, the 
meet COMmpuUteTs use different command languages, Ce ay 
languages, and file management systems. This heterogeneity 
can sometimes te confusing to even experienced users. For 
example, the UNIX commana to delete a file is ‘rm’; while 
the comparable DECsystem-2@ command is “DEL”. To display a 
directory of files on UNIX, the user types “ls -1°3 on the 
DEC-2¢, he types “DIR°. Well-designed voice input systems 
Can alleviate tkese problems, to et least some Extent, by 
making these different command languages transparent to the 
operator. For example, one widely-used system allows for 
Eaewderiortion Of voca@¥ulary sets that can provide a mapping 
between ¢tandard input phrases "and dirtfrerent “output 
Cheracter strings aepending on the applicaticn context or 
Vee “particular host computer being accessed. In our second 


exarple apvove, the operator might always utter the phrase 
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‘display directory” and the voice input system, in 
cooperation with the hcest computer, would perform the 
MeerstOrmation to the atpropriete character string. Voice 
imput can also reduce complex connection protocols to one or 
two natural fhrases that make it eesy to establish logicel 
Gompections. 

Voice input systems are ideal for use by untrained 
Oeers- NO physical skills, such as typing or other keyboard 
manipulation, are required. The naturalness and humanness 
of the speech input interface make it applicable for users 
wee. weeeral skill lewels: ‘trom clerical personnel, to 
computer operators, to top-level manégers. 

Operations where speed and accuracy of comtrunication 
mee €Ssential are alsc excellent candidates for voice input. 
In the ARPANET experiment, significant improvements (see 
above, in the speed and accuracy otf data entry were attained 
Seve thoveh the test subjects had no previous experience 
were "Foice input. some studies have fourd that peak 
efficiency is not attained until the cperatcrs have had 45-4 
memtnhs of experience. This suggests that even greater 
PRCduUcCTIVAaty gains can be realized. 

Because of technological limitations, which we will 
bursve im-detail in the next section, candidate applications 
shovld heve a relatively Miimited input vocabulary. 
Contemporary yoice recognition ey>tens will support 


vocebularies of from abdout EG to over 25d discrete 





utverances or phrases. Applications with a well-defined but 
limited command set are the most likely candidates. Most of 
the vocabulary should be capable of being “canned”; a 
language that contained many fields with arbitrarily varying 
imput would, therefore, mot be very suitable. For a related 
mechnclogical reascn, applications with an input language 
Meade up of diserete commands and fields is much more 
adaptable to voice input than one where input is in the form 
oe wnstructtred character strings. Fer example, current 
technology would readily support an informaticn storage and 
retrieval system using a fcrmatted query language but would 
be inappropriate for a word-processing application. 

Mag ioe deétimed this profile, it will be instructive 
to compare it against the CNCC operation to see how well the 
CNCC snapes up as a voice input candidate. Figure & lists 
Mee Si@ Characteristics and a subjective assessment of how 
mertl @&@cnh applies to the CNCC. Wwe see that, with two 
Preeeurlows, all the features are present in the Netwerk 
Monmvrol Genter. Tne Networ* Controller is often busy doing 
something else when he is calied upon to input commrands to 
mn soem, the ne€ea to control a number of systers clearly 
Pecunres Pay sued | mobility, the MesSwoOne 1 Dilly y 3803 a 
Teintdining an operating CCINS network be qures that 
accurate information te entered quickly; and tne CNCC 
command language is small ara reasonably well structured. 


Mme GNCC verscnnel are adept at entering ccmmands and data 
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into computer systems so the benefits to untrained users are 
net really relevant. 

mee OHlyY Uncertain area is that Which we heve 
called virtual robility. There is no question that such a 
requirement exists in the CNCC. Given the number of systems 
eer wWoich the Network Controller is resrfonsible even a 
Bimeeeed ameurt. or ccnnection-protocol simplification and 
command languege transparency would be quite beneficial. 
Pee g@i-ticrlay is that voice input can only help if the 
Systems in quvestion are already connected electrically and 
Paey Mave ah agreea-upon rrotocol tor connection, even a 
basic timesharing protocol such as ARPA’s TELNET. It does 
mol sepptar thet such electrical connections and protocols 
Smlst today, e.g. there appears to be no methcd of accessing 
mae OUMer mechimes in the CNCC complex directly from the 
Summary TTY. Nonetheless, in view cf the cverall assessment 
Se@en in Pigztre €, the expectation is that voice input can 
Pe of considerable benefit to CNCC ojyerations. 

Se. “Overvicmw of Voice Input Technology 

AutOMatic Speech Recognition is a subset of a 
breader intellectual domain known as Speech Understanding. 
Sreech Understanding systems (SUSs) have the objective of 
preperly interpreting the intent ct a speaker even when the 
Soeeen is not grammatically correct or well-formed. SUS S 
meet, wer @C@xemple, take into 4&ccount the tendencies of 


Speakers to talk iz incomplete context-sensitive sentences 





end Lmmoputme Uitberances with occasional non-speech 
elenents such as uhs , “Ya knows , etc. It is important to 
mote that SUS are not so much interested in correct 
Becoputrion of every word but rather focus on the reaning or 
Peeare conversational segmerts. 

Even when the domain is relatively restricted, 
er menvims Whe use oF task-specific information, the job of 
“™ eepsech wnderstanding system is truly formidable and 
ewrOrts tO @ate Have peen rostly ot acad@€ric interest. In 
Bee Seeeking process, a speaker transforms concepts into 
meeecia tbrougo processes that introduce variability and 
mceise. In an attempt tec understand the concepts, the system 
Must déal with phonemic, iexical, syntactic, and semantic 
Hewiemowor meaning all of which are susceptible to the 
Mimmoovct+0n of additional noise or error. In the words of 
Bieeeesieners cm one cf the pretotype SUS, 

me cOmprenend an utterance in the context of such 
Sprors, “a speech understanding system must formulate and 
Cvaluate numercus candidate interpretations cf speech 
IreeMemts. Um@erstemding a Message requires us to 
Reomane ema recOznize its individual words and parse 
U@eir syntactic and ccnceptual relationships (Ref. 12: 
Oe. 2s i. 

Mee 20els cof Speeco Recognition, in contrast, are 
muicm less ambiticus. Instead. ‘Of dealing with ebstract 


concepts like meaning and understanding, speech recognition 


Systeme attempt to solve the more practical problems of 
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analyzing the acoustic waveform and applying pattern 
recognition techniques to differentiate between utterances. 
ASR systems can be considered as belonging to one of 
twO categories: continuous (connected) speech systems or 
isolated (discretej speech systems. (Scme authors draw a 
eistinction retween continuous amd “Compected ““speecen, 
peperdineg thre latter as a somewhat simpler preblem. In this 
maes is Mae temms ere usec€ interchangably.) While “the 
distinctions between the categcries are often obdlurred, 
discrete systems are those that require a short pause (on 
the crder cf 102 to 2@@ ms) between utterances; Connected 
eos, On the other hand, can recognize words without 
Seeplicit kmpewhedge of their endpoints. Continucws speech 
Beceenitiow is considerably more difficult than recognition 
ee Gwccrete utterances. First cf all, such systems must 
e2eempose the acoustic waveform into phonemes and words at a 
real time rate. Thisc naturally requires a great deal of 
corputationéel fpower. Seconda, the ecoustic varilationmof 
memes smpekenm in comnected speech is different than when the 
words ame spelen discretely. imas  1Saeecals Cdaoy Seaae 
Gere Seulatiom of neighboring sounds, i.e. the positions of 
meeeeeoem@eue, Sav, and lips in a speech sound are affected by 
See previous end future positions. COMUINWONS Ss rECOgn rE Zers 
[iveteunenerore, be very sensitive to converséetional context 
Mets 132 p. 27]. There are some  connected-speech 


Me oeemazers On tne market toaay but they are expensive 





($5¢ ,@@@-$12¢€,22@) and their capabilities have not really 
Becn EVeluated. For the remainder of this thesis, then, we 
Beri CONS re Our discussions to discrete recognition systems 
meecre are commercially available in price ranges from as low 
Bs 30@8 upwards te about $15,¢0e@. 

There are two otner limitations with most of the 
eonmvemporary systems. Poeetirstwdedls with fimitations on 
vecatulary size, the seccnd concerns speaker-independence. 
Oeopmercially available systems support vocatularies of about 
49-256 discrete utterances or words. (In speech 
recognition parlance the terms word and ‘utterance are 
used interchangeably, even though the term “phrase might be 
more appropriete since utterances ere often corposed of more 
Daan one word. ie acueiie OboemeoSt. SySterms are sSspeaker= 
@ependent, i.e. they require a user tc first train the 
Peastem With his voice patterns for @ach or the utterances in 
the vccabdvulary tetore using the system. (An exception is 
See tyce of recognizer that supports only a very specialized 
vocabulary, e.g. tne 18 digits;. Fer most applications, 
memmmerrestriction has much practical effect. Very few 
Computer—-related apn licatlons require more than £80 
utterances and training is usually accomplished quickly and 
easily. 

Figure @ shows @ block diagrem of a typical discrete 
WwOTd Preca@n1tion syster. While the aiagramr and _ the 


Miscussion to follow describe a particular implementation 
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Meeture 9. Block Diagram c% Typical Speech Recognition System 
(From Reference 12) 


ED 





mescerroec by Thomas B. Martin of Threshold NeGRROLOe yy, Inc. 
(het. 10: pp. 17€-175], the general appreach applies to most 
of the systems available today. As the diagram shows, the 
system takes a spoken utterance, attempts tc match it with 
eee 6oF rtS©6CPFE-StOTed voice patterns, end, if successful, 
See rues a pre-defined string ot characters over a standard 
M-e-- interface to eae host computer. In our example, the 
spoken phrase Clear Crash Imp 2° generates the output 
character string ?ECLEAR CRASE<esc>3<esc>” which is the 
proper cperatcr entry at tae Summary TTY in response to a 
numroer of Observed network problems (e.g. IMP CRASE 
messages, PACXAGE = n messages, etc.) Tints example 
Sc e@otreses a number of important points about ASR. First, 
the spoxen utterance is considerably more natural than the 
typed commerd string. Second and probedly rore important, 
mee voice input device is completely transparent to the host 
computer (in this case the CNCC machine). As far as the 
mOSt is concerned, it is getting character <‘trings in the 
Same code and 4&t the seme bit raté as if they had been typed 
Gaauelly. Thus no host sottware modifications are required 
moeimme cet. nis is critical in light of the complexity of 
eee ewenmee 6program, the limited primary merory, and the 
Pcncsequent difficulties in raking software changes. 

we see from Figure $ that e system consists of four 
wmeeme  CONWDOEPNOLS: a Mlcro,~none transducer, a freprocessor, a 


MeeUlIre Extractor, end afiral decision level classifier. 
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Farly speech recognition systems were weakest in terms of 
Teature extraction, often deleting the function COnpietely 
or Vsite -2 rudimentary form of template fatering which 
Eroved inadequate and was soon re‘ected. 

The micrcephone, of course, is the interface between 
eee ester and the system and converts the sroken phrase into 
arn electrical signal that can ode analyzed by the cther 
components. The preprocessor perfcrms a number of functions 
Oeeemecwebectrical stemal. It normalizes the signal in time 
SO thet it can be léeter compared with the reference 
batterns. Dynamic frogramming is one technique used to 
echieve this time adjustment or warping. The preprocessor 
eso CVerrorms classical time or frequency dcrain analysis on 
tne input signal. There are two approaches ccmmonly seen: 
ae Tirst uses bandpass filtering and Yourier analysis in 
be frequency dcmain; the second uses Linear Predictive 
Meemwac (90) to analyze the signal in the tire dorain. 

Feature extraction is the most impcrtant processing 
meemeyion im emy pattern recognition system of which speech 
Pee weenen Systems sCrm a major subset. Both the spectral 
Shape and the time derivative of the spectral envelope are 
measured over the frequency range of interest. Combinations 
@ea “SeqUtUences of these weasurements are used to produce a 


mereco ee acoustic features wnicn include such things as a 
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worc boundary estimate, spectral amplitudes, and formant 
rrequencies. 

Wn west of todey’s systems the flees T three 
functional components have been implemented exclusively or 
pramerily in hardware in order to achieve real time 
processing. In contrast, the classification or decision 
precess is usually implemented in software or a mini or 
Saerocomputer . The classifier uses pattern-matching logic 
powcomparme the features of the utterance with the pre-stored 
reference patterns GnGmeUSeS ae besSv=fitmeralroritam. to 
determine the ccrrect one. It is alse possible to produce a 
“no-decision or RejeCamy RCH NOMeMOfethe reference patterns 
is clese encugh to the utterance. When the classifier makes 
mee SECiISIOn, it uses the number of the best-fit utterance 
mee wecy ae COrrect OULpUtT character String which it then 
femes On a SErial interface to a host computer. 

PrerTre are tyo tyres otf @rrors that can occur in 
meemen recoénition. The first is rejection or the inability 
wo scrrectly classify an utterance. The second, and more 
troublesome, occur when the recognizer substitutes one 
Sewerance fOr another. Rejection errors typically cause few 
problems since most recognizers will inform the user by an 
Ee eble Deep that an utterance was rejected. SuUbDStTITUUIONS 
ome “Mere difficult aera tne better systems usually have 
P@eerne tion eaelgorithms designed to reject rather than gvess 


weeoweeritomenle “words. The better quality systens, such as 
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the Threshold Technolcgy 6@@ and 6&¢, have error rates that 
are quite eccerptable. In the imagery interpretation 
Gzperiment, for example, recognition accuracy was 97% if 
Only SUbSTIUURIOnS Were cOunted as errors. Even if rejects 
Sere ibeluded imetwhe error ceunts, reccgnition was still 
better than 95%. These figures compare quite favorebly with 
Peculvc OCtenmed trom manual typing. Results of other 
evaluations héeve been rovehly the same. We cén Say with 
femme Gerteintg, when, tnat lirited vocatulery, speaker- 
@eEpemdent Voice input systems now represent stable and 
Mature technology that can be usec in a nurber OF 


applications, and certainly in the CNCC. 


oS VNCLGe OUTPUT 
ei Wypical Applications of Voice Output 

Pa COnMmeet tO VOice input technology, the use of 
VOice output is not so well-detined either in terms of the 
Demmmelcey itseis’ cr ct its applications. Manufacturers cf 
Yoaece wnesponse Squipment mave yet to settle on a common 
technclcgical approach and applicaticn outside the telephone 
POePSotre Pas been comparatively limited. In general, there 
are three brcad areas where the use ct speech output devices 
nes proven oeneficial. Beforeé discussing the technology 
Stether, 10 is Worthwnoile to look at these three areas more 


close sy . 
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The fers. v avpiecetlion area involves those 
cperations where the telephone has always been an integral 
part of day-to-day operations. The most obvious example is 
im the f7hone industry itselt where cOmputerized speech has 
(elem used fer some time to perform ‘functions that have 
treaitionally been done by human operators (e.g. The number 
you have dialed is not in service’, etc.) Voice response has 
mese bMen used in ée nurber of private automated switching 
eySmwems such as mc | EXECUNET, a Oa nc Of MCI 
meliccommunicaetions Corporation which links e@ number of 
Cogmpitmomics €él Speechmakers to the Universal Switched 
Webwork of Danray, Inc., a division of Northern Telecom 
[Ret. 14]. AmoOther type ot telephone application has been 
Meenawplementaticr cf phone order-entry systems. The Ford 
forcs CGm@eagy S Dinect Order Entry Syster (LDOFS), for 
example, has teen in cperation since the mid-i97@s and has 
meme VeCTyewehl—=received by *ord and Lincoln-Mercury dealers. 
Pmevauseapplbicaticen, the cemputerized speech output prompts 
mae deoler Whrowsh the parts ordering process. Velo tr 
penerits have been instant stock status, greater accuracy, 
Seeerewea StOCe turn, and ... overall iprproved customer 
cervice. [Ref. 15] 

Myre second broad application area is the use of 
Gomeu@tverized speech to provide informaticn traditionally 
Sue irem by a HYimen speaker. An example is en experimental 


system develorjed for the National Aeronautics and Space 
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Sadministration (NASA) designed to test the concept of 
auipomatic approach callouts. This system (called SYNCALL) 
mood symebesizved speech to generate the aperiodic voice 
waperts eiven a commercial airline pilot by others in the 
Cockpit crew during the approach and landing phases of 
flight [Ref. 1€: pp. 4-9]. Althoveh SYNCALL was developed 
ee Gb eCzperimental system, it did result in improvements in 
fligkt performénce and was generally favored by pilots over 
wee tredatbonal cadlouts by the cockpit crew. Irproverents 
Were moOSt marked for approaches with high manual and visual 
Bomkboee [Rer. 16: p. 78}. Variations of this idea have 
used speech to replace outputs usually generated by analog 
Or adigital readout devices, such as altimreters and fvel 
gauges. 

ihe Wadird major applicaticn area of voice output is 
as a replacement for traditional alarm devices such as 
ligaots, bells, or sirens. The principal advantage of speech 
woe in this contert is that it conveys considerably more 
bafOormewviorn then the other alarm types. If, for exarple, a 
Cempeeer Operatcr nears a bell sound on the computer 
@ooee le, be first mest go to the console and read the typed 
O@Mmpmt MWetrore identifying the problem and deciding on 
corrective acticn. On tne ctner nandcd, if ne hears the 
phrese printer number two is jemmed , he can immediately go 
mer wae @eyice in question and correct the yroblem. cis, 


Oroweemrse, this eprlication erea thet we are rost interested 
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eee te CNCC. As we discussed in Section III, the CNCC 
Contreller is very busy and quite often away frem the Logger 
Meretype. heplacing, or supplementing, the more important 
Legger TTY messages with veice response might permit him to 
@@ Gis job faster and more effectively and save steps in the 
Brocess. 

pier MeaScwesiomeof speech recognition, we bPriefly 
meotioned voice output asa teedback mechanism in a voice 
input system. lS might spe «eomsidered an auxiliary 
@e,aicatiom of speech output since its fundemental raison 
d°etre is to close the interactive lcop with the vcice input 
user. An opératocr may be some distance from the speech 
recognizer that is processing »is utterances. It is 
inconvenient to heve him walk to ea TTY to receive feedback 
Sa DiS inpwt conmands. NOroce Tesmonse is a4 lo gasca Ll 


candidate Yor the *eedbdack mechanism. In the current thesis 


> 


GS vwame toiwusing primarily on the eléerting capability of 


in 


Beeen CUmput although there is limited use cf aural 
feeqbeack in the CHCC model. The CNCC computer provides only 
Bema iedgs seeCad@beck as it stards tcday, making the design of a 
tTOtal closea-loop voice input/output system impractical. 
Thais subject deserves a good deal more study and would seem 
[Ome a 200d approech to use in the CNCC replacement system. 
The Bechnology is clearly here; it remains only to apply the 
Veeemotcme to tne design cf a voice-controlled tran-mreachine 


COmmmmacation interfece. 
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Coeur lem Of Voice Output Technology 
While development of speech recognition systems was 
not pessible betiere the introduction of inexpensive 
electronic computing equipment, artificial speech systems 
heve fren deeper historical roots. As early as. the 
Renaissance, man began to look for ways to simulate the 
a€tions Or t@e compiex vocal m@€chanism by mechanical 
Soatrevaences, many of Which were quite clever in design 
[Ref. 17: p.20&]. Feirly elaborate speaking machines were 
@emseruetea by scientific ploneers like Kratzenstein, Von 
KFerplen, énd Sir Charles Wheatstone (no doubt more famous 
for the Wneetstone Bridge}. These early mechanical efforts 
usuéelly involvea the use of oellows thet genereted varying 
oer CUTPENtS tarowgh 2 vibrating reed. 
AS a doy, Alexancér Granam Fell and his brother 

Ott a Speating automaton by making a cast from a buran 
ssull and molding the vocal parts in gutta-percha. One can 
Meee “Speculate on the part that this youthful endeavor 
played in Beil’s later wera cn the telephone. As described 
by Planagan (Ref. 17: pp. 286-207], 

See bs, tOMeue, palate, teeth, pharynx, and velum were 

represented. The 1ipS mere a fremework of @~lre covered 

Weve SUProer Which hed peen stuffed with cotton bailing. 

Rutcer cheeks enclcsed the mouth cavity, and tne tongue 

was Simulated dy a rubdcer Skir end Stuffed with batting. 

Mee Parts were activated by levers controlled from a 

Meyooerd. & laryex tex’ was ccnstructed of tin and had 

@ fPlexzivle tube for a windpipe. A vocal cord orifice 


Beceicee Gy stretching a slotted rubber sheet over tin 
SUpporvs . 





Met ian ell it wes apparently quite a creation that could 
utter vowels, nasals, and a few simple phrases. 

Imterest in mechanical analcogs of the human vocal 
a7 > vem contiaued into Lhe twemtieth “century. cashes 
eepreeches tenmded tc depart from the human physical 
Sepresenteation and turnec instead to devices such as tuning 
SGmes af@ organ pipec. The evclution oe electrical 
technology accelereted the move away fror physical models in 
See ArFEection of systems that generated sounds by electrical 
tuning of amplitude and BVequency.. H. W. Dudley 
deronstrated his ¥Yocoder or talking machine” at the 1939 
world’s Fair. In the 195@s and €@s, accustic engineers made 
Somcidem@acic improvements in analyzing and representing the 
human vocal tract. Advances in ccmputer technolcgy and the 
Seemp reductions in cost of electronic components have 
@avanced the state of the art te the point where reasonable 
Quality synthetic speech is available et relatively mcderate 
Cosa. 

oe WesSic functional components of a computer voice 
response system are shown in Figure 19. The rachine is 
Pemumree Be Speak a phrase typically expressed in spoken or 
ieeed Saelish text. The synthesis program must somehow 
Cremetve tae Original text into a digital sStrear that 
Peemesemts the desired output including, if possible, the 
eeeer duration, intensity, and boi eect 1 on i (0) Je the 


imeem oed COntTExXt. This stream 1s then input to a digital 
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feomre 17. BSlcck# Diagrar of Comruter Voice Response syster 
(From Reference 17) 
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speech synthesizer which speaks the message either over a 
telephone or thrceugh a local speaker. While the functional 
boxes ere similar for all voice response systems, there is 
Considerable difference in the way that the synthetic speech 
iS produced. 

there are three Das ic approaches to speech 
Sy@thesis; twmey are distimguished largely by the storage 
Gapecivp meeded for the vocabulary and by the complexity of 
the CONT rel Pees 101 generating “the speech. These 
techniques are: adeptive differential pulse-code modulation 
(ADPCM), formant synthesis, and text (or rhonemre) synthesis 
Weee- 17: pp. S-6]. #Their respective data rates and 
mermpeavaet in mequirements are compared in Figure 11. 


Peeci is whe simplest technique. Mie uses a 


vecabulary of nuran-spoken words whose waveforms are 
cigitelly ccdec. Tre methcd requires tradeotfs tetween 
Signal GQ Vied i t¥s, Storage capacity required [Ote one 


Mccapuitemy, and the simplicity of the message-generating 
BROLT|aAM . Bor message essemoly, the synthesizer retrieves 
the digitelly codaea words from a disk storege device and 
apolies them tc an ADPCM decoder which preduces the analcg 
Cleese Sapm@ai. With this téchnique there is no way to 
Womewmormeeyproase@d,y, that is, a0 control of vocal pitch or 
Memetne Of Voce! resoOnences iS posSicle. For this reason, 
Pre emeet dpprepriate applications are those with minimal 


Pessvpliity for semantic smbisguity. ADPCM has been used to 


71 





CODING 


a 


ADPCM 


PORMAMT 
SEN Tae 1S 


Pons 2 
5 Pw Tees [5 


Brevre ii. 


DATA RATE 


cOK 


Se? 


=~) 
() 


palit s/s 


bits/s 


bits/s 


AMT Ot STOR DD SP ERCH 
DNS 1 Tiieeeehe BAS 


1 minute 


OPiiL Ewes 


£40 minutes 


Pata @eeeS 2Or Jiprerent synthesis Techniques 
(from Reference 17) 





Bemeraue VOICE output in applications such as equipment 
esserbly, numeric readout, @€nd stock market quotations (Ref. 
oe i pe 6] . 

The sé€cond voice response technique, formant 
syethesisc, uses a method in which a word library (again 
initially spoken by e numar) is analyzed and stored as the 
Pe@e VerTietions of wocal-tract resonances or formants. 
These frequency veriaticns are computer anelyzed and used to 
Gene a «(daeital filwer whose input excitation is derived 
from pregrammed rules fcr voice pitch and scund amplitudes. 
FOrmant synthesis has been used for the sare application 
enees as APPCM. 

The third, and in some ways the most advanced, voice 
respense technique is called TEXT synthesis or, 
alternatively, phoneme synthesis. It generates speech from 
® Gata ifpWt operating at a typewriter rate, i.e. abceut 75 
bits/sec. Systems that use this technique generate the 
Porce @Utput entirely ‘rem stored rules and dictionaries. 
roemege symthesis, thus, makes Dosst ole ne da mec 
Comwmemenm of typed Hnieclish text tc synthetic speech. This 
PPemrorlity does not come without cost. Text synthesis 
systems are invariably scftware-intensive. Reutines are 
required to convert the text strings to phoneres and apply 
prosodic rules to make the speech sound ‘normal. Such 
sottwar® is quite complex and usually can only be written by 


somecne whe combines software development skills with formal 





eeewmene it’ Linguistics, a combination not frequently found. 
In spite of this imposing software requirement, phoneme 
Sameemesis is prebebly the best fer ccmputer applications 
Since it provides maximun flexibility and has a data rate 
that 1s cempatiole with standard computer peripherals. The 
@emerce Wwsed tO rub the CNCC model, the VOTRAX ML-I, uses the 
pecmeme Symtnesis apprceach. The *ield is quite volatile, 
nowever, and new ana less expensive products are being 
intrceduced all the time. Improvements in digital recording 
end compression techniygues and conparablée advances in the 
parameterized waveform (formant synthesis) method may mean 
thet text syntthesis does not represent the best long-term 
eevermative {Ref. i8: p. 74]. 

POice output is evailable in three forms: chips, 
opoards, and terminals. Chips and boards are inexpensive and 
Seir-—comtaimean (i.e. the speech waveforms are locally 
stored) thus tney don’t normally rely on a host mainframe. 
Trey ere Per ly inflexrvole, however, and require 
Semerveaeregle Sxpertise to integrate into a workable system. 
Peaces Fer Voice’ cutput terminals Stert around $598 and go 
as high as $52,90@@; most are priced under $12,¢¢@. These 
Mewmrrals are designed to connect directly to computers via 
am@ fo-2e2 or 22@mA serial loop interface and usually rely on 
external memory ‘tor vocebulary storage. Most of the 
VGRGioais use pnonere synthesis to generate the output 


Seecen. because cf their ease of use and greater cverall 
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Capabillivies, voice Verma Ss would seer “te de more 


peemrepriate for the CNCC application than chirs or boards. 


‘Go 





¥V. THE CNCC MOTEL 


&. DESIGN GOALS AND CCNSIDERATIONS 
The CNCC model is a discrete event simulation that 
moO@els the external behavior of the COINS Network Control 
Cemeeer COmputer. I[t does nct attempt to mcdel the internal 
operations cf the CNCC, nor to simulate the flow of messages 
amd’ pacxets across the CCINS II network. fasted. =1¢ 
focvses on medeling the system from the point-of-view of the 
feewore Comtrolier, i.@. it simulates the output behavior of 
mie Veeeer Tli and the input and output capabilities of the 
Surmary TTY. Design and implementation were driven by the 
Sci lémime six design gceals. 
®. 9¥8e model should serve as a realistic simulation of 
the external Gemavior of Swe FCNCC Computer ~and 
Oopérating system. 
Peer Seed Trnovide a yehbicle to demonstrate the feasi- 


eilauy of wsSinmg vcice input and cutput technolegy as 
ae toVeeral part of the system. 


S. It should be able to serve as a voice technology 
experimental testoded. mt sist Oe sarle tO Eun in 
Several different operating modes and to measure 
Variations ino US eG response tires and overall 
performance under tnese different operating 
environments. 

=. tee model must be implemented on a computer system 


mie beraccessitle tc the COINS PMO. 


(@) 


It shouvula be modularly designed and independent, as 
fer @s ocsslble, of any specific voice input cr output 
nardware. 
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ee “me mwo@el should be extensible and easy to modify so 
17 car serve as a long term experimental testbed for 
the CNCC. 


The most important ot these objectives is that the model 
Meee 6 eo LIStHe )6€6pCcrtreyal )6€6ccf «the GNCC environment. It is 
essential that the model behave in a manner censistent with 
the way that the real CNCC ccmputer acts. Fer example, the 
medel must drive two independent terminal aevices, one for 
aoe 3 yeeeer TTY and ome fer the Summary TTY. Processing of 
cperator commands should be identical to the way that the 
CNCC nandlés command input, i.e. the respenses and prompts 
Meet OE tee same as on the real CNCC. laerly “Comereve 
Sumscet cf the ChCC command language must te supperted, te 
oem Wee opePater to perform the Wetwork Comune er 
fFunctiomws through the model. Network events should occur in 
Me model in the seme wey that they do in real life. 
were = aid Sfrors in Metwork corponents are, of course, 
Set weDErMinistic. Tme medel, therefore, Should use 
Seems cic metpoecs to generate the network events. If 
Matvones of events were predictaole, it world be impossible 
oe otsecumt tne effects ot learning when running the model. 
Se B@ould be considerably more difficult to evaluate response 
Dime Chenees under the various operating moaes. 

Samce whe mecacel°s main purpose is te explcre the 
Moeeemeeerty of Voice technology, it must be capable of 
meee wen OUut srcm a speech recognizer and sending cutput 


me a) 9SPeech S¥nvhesizer. Speech input is, as noted in the 


o 





previous section, transparent to the host computer since the 
recegnizer appears like a terminal device en an RS-232 
met tal interface; voice output, on the other hand, is not 
Beariy sc trensparent. One cf the key software routines in 
the model is a generalizea phoneme output procedure that 
Mewerecces With the VOTRAX MI-I synthesizer. The routine is 
Peewee -aravyen so it should be reasonably easy to frodify it 
for cther pneneme synthesizers. In additicn to the veice 
Suyput routine, we aeaeTet and implemented a general- 
purpcse text-tce-speech prcgram that allows a user to create, 
Ssore, and wodify phoneme strings from directly from English 
Meat input. 

Ore possible use of the model is to try to quantify 
meerowmemeolsS attritutec to the use of voice input or output. 
mee @c enrs, it shculd have the ebility to measure user 
Peepouse times in at least three modes: manually typed input 
See printed Logger TIY cutput; voice input end printed 
Dopeer OUtLFUL; and voice input and output. By exarnining the 
results of these resporse time measurements, it may be 
Pessiole’ to Ware an assessment of response tire asa 
Eenction of input or cutput mode. 

10 have real long term value, the model Should be able 
to be used ety COINS PNO and CNCC personnel to investigate 
"Gece Wechnology design tradeoffs. For example, we may want 


Dae Woegmewe at the eEfTects of different input vocabularies or 


elternate phrasing of ovtput messages. The model shold 


We 





mrcvide twee framework for studies of this mature. For this 
reascn, the mcdel was implemented on a PDP-11/78 computer 
running the UNIX operating system. It should be easy to 
transport to one cf the CCINS Terminal Access Systems. 

Modular design and device independence carry with them 
poe @istinct ccnnotations cf motherhocd and apple pie. 
merertheless, we didi) make a real effort to embody these 
DPeemeres in the CNCC model. The program is made up of a 
member ef Small, simelie-function rowtines and, except for 
pepe YOPRAX-dependent ccde, avoids ties to specific physical 
mevices. Time UNIX 1/0 system, which views all devices as 
files, nelps maintain tnis generality. To-intverface to 
emovhner WOlce output device would require only that a new 
meaew driver be written. The message definition and 
peoneme retrieval logic can remain intact. 

The design of the model should permit extensions and 
modificaticns ta be mede without changes tc the basic 
Semewene Structure. TO ensvre this, most of the imrportent 
mpocedures are Bfie OT tUabdige-ariven. Parameters and 
Beysical device assignments can be changed at run time. 
maa@ige few cOmmends Or events requires only adding an entry 
POWbae apreropriate tetile and writing a handler fcr the new 
Beeeeo. Fae entire ctrograr is Written in DEC FORTRAN-IV PLUS 
es medified for UNIX oy CULC Inc. This combines the 
edvanteges of writing in a commonly-knowr language with the 


mel iGe vo Wse mamy of tne basic features of UNIX. 


fin. 





EB. EARDWARE AND SOFTWARE ENVIRONMENT 

The CNCC model operates on a Direital Fquipment 
Herporation (TEC) PDP-11/7€ computer running the UNIX 
Sperating system. The system on which the rodel was 
developed, in the NFS C3 laboratory, uses a version of UNIX 
Suprerted by Eclt Peranek and Newman (BBN). BBN UNIX is a 
mperi@ of the sixth and seventh editions of the standard 
mestern Electric UNIX The wodel consists of two cooperating 
peocesses both coded in CULC FORTRAN-IV PLUS '(F4P). F4P is 
a superset of ANSI standard FORTRAN that provides access to 
the UNIX I/O system ard to many cf the standard system calls 
iket. 23J. Since portability to non-UNIX computer systems 
mes BG® @0 Gxplicit design goal, we felt free to use special 
24P features like byte variables and UNIX system «calls. 
the medel shevld be easy tec install on systems running 
Wetients of UNIX other than the BEN version. 

ln waded aiticn wo the PDP-11/72, the hardware Suite 
femeee> «Of « €6twO6U TL erminals©6©6©ccomnected to the mainframe via 
Ser@al 8S-2cS2C interfaces. In the WPS configuration, we 
YWsed two AIMS CRT terminals maaufactured by Leer Siegler 
ice 2887 Wermiael that looks like a teletype to UNIX, 
Eyem@er Herdcopy or CAT, would work just as well. Associated 
mae the ocummamy TTY is a Threshold Technology T-6¢@ Speech 
Recognizer used for input of voice commends. When operating 
Maret ce Tiruy, mode, the operator speaks short conrmand 
Dhrreses to the T-€290 whick converts them to strings of ASCII 


ss 
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Characters acceptable tc the medel’s command interpreter. A 
Meets Of the operator utterances along with the associated 
Character strings is yiven in APPENDIX C. The line to the 
Logger TTY terminates in a VOTRAX ML-I Audio Response 
Pee ueme. This device normally passes on all characters to an 


DMS cennected to its business equipmert pert. When the 


Meee met ects Special control characters in the data stream, 
Mm GGWerprets the characters that follow as a series of 
phoneme codes rmeking up @ voice output message. The voice 
OuUput messages used by the model are summarized in APPENDIX 
yp. A DlocK diegram of the hardware configuration is shown 
po Pigure ic. 

The todel’s scftware comprises two F4P programs, named 
meee @na Tiyin. In addition, a voice output editor (mi1) is 
mSse€q to generate the phoneme strings fcr the synthesizer. 
when all programs are considerea, the model involves 
eapprcximately <8@,¢g¢@ executable FORTRAN source language 
staterents. Code and supporting files occupy over 1€090 
blocks of R3P@6 disk storage (abcut ECOK bytes). The source 
ao0re Pox all software is the property of the U.S. 
Perviecrnment. anyore interested in pcssible use of the 
paomems Shovrld contect the author. Cperating instructions 
metre Prceranm can be ftovrnd in the ‘file cnec.hlp. This 


Pate is include as APPENDIA A to this thesis. 
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C. SUNCTIONAL DESIGN 

BPerre i2 can serve as a useftl Starting point for 
discussing tne functional design ot the CNCC rodel. After 
initializing its internal tables and parameters, the model 
mews FOr eCltner an internal event to occur or aiuser 
COfMmard to be entered or tne Summary TTY. As events occur, 
they morrally generate messages on the Logger TTY. Certain 
messages are acccmpanied ty an auditle beep that serves as 
om cperetor clarr. If voice output mode is enabled these 
ewents will produce a voice output message in addition to 
the primved OUTPUT. sore events, typically automatic 
SumGary reports, are printed on the Summery TTY. The 
meewere Comtroller can also influence events by typing 
cormanas (or speaking ther) at the Summary TTY. These 
cOmmanas are often in response to previous Logger messages 
(ror example, the command to reload an IMP following an IMP 
aqowr or IMP trap Logger message). Figure 13 lists the 
Deeermelly generated events supported by tne model. The 
Network Controller coOmmendsS that are supported in the 
$Bitial release ot the model are shown in Figure 14. 

Bae iest Sme@wn in figure 15 accounts for the most 
Teoquemely occurring network events. For simplicity, the 
progremn. assumes Thet all component failures are 
Smeenmemtially distributea eccording to some rparareter value 
fcr Mean Time Zetween Failures (MTEF,. In the runs of the 


[ome e COM@ucted at NPS, we used a MTBF for both IMPs and 
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Memes c#*cne day (86,422 seconds). We also assumed that 
sore types of failures are more likely than others. Frror 
conditicns on a line, fcr instance, are much more probable 
than the line going down. The program uses a probability 
distribution that accounts tor the relative likelihood of 
dizferent events along with exponentially distrinuted random 
numoers to select and schedule the next event for each 
component. mee pemamecer for the exponential distribution 
meee oe Medizted by the user, at rtn time, without changing 
omy proerem cede; the relative likelinood distribution is 
stored ic an internel table, however, and can ve changed 
Gy Dy a presram reccmpilation and link edit. 

As can o€ seen from Figure 14, the model does not 
Seoupert “all cP Ahe CNCC commands. those mest commonly used 
me respond to network fellures, however, are all included. 
iwo of the commands are not part of the real CNCC command 
Meper vorre, but were implemrentea for user convenience. The 
QUIT cormand allows the user to gracefully halt the rodel; 
pre Doewe STATUS change ccemmerd allows the user to set the 
owas OF a given Pine to UP, DOWN, or LOOPED. The command 
Meeerereter benaves exactly like the real CNCC. wach 
Ge@mrmermd must be preceded Dy @ guestion mark and the operator 
Deo Orly tc type the characters needed tc uniquely identity 
the command. In response to each command, the system will 
ESener perzOrm cr simulate some action and send an 


appropriate response to either the Summary or the Logger 


E6 








TTY. the program will take one of three possible actions: 
my dettally ypertiorm some Seiweee, Such) aS* printing a 
Gemeetery of files; (2) simulate an action and type an 
Mereciate response, such as simulating a tile deletion; or 
(5S) use some probability distribution, such as a normal 
efsurioution, to determine the completion time tor an action 
that involves scme delay caused by disk ocr comrmunicaticn 
transfers. Sor example, the response to the RELOAD command 
Mill be scheduled using a normal distribution with mean i12 


and standérd deéeviétion 2 seconds. 


D. INTERNAL DESIGN 
1. @@erell Structure 

The CNCC model is composed of two cooperating 
processes toat communicate cver a specially designed 
Paverprocess Communication Facility (IPCF). An overview of 
the mcdel“s precess structure is depicted in Figure 15. The 
@@ilad process (ttyin) performs the single function of 
Secepuime awe Validating coOmmanas Entered by the operator at 
mae Summary TTY. @her a legal command is reccenized, ttyin 
rasses a message to its parent process (cncc) that contains 
all information necessary tc execute (or simulate) the 
commana. Pi merc MOns “OLMer Behan cofrfand input are 
pera Ommed ty the parent. Wiese Vinecludge (Scheduling “and 
SFecuting network eévents; processing operator commands 


eeemeeas DY Ltyiny printing sufmary reports and other output 
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on the Summary TIY; and sending alert messages to the Logger 
TTY, including voice output messages it speech Output mode 
Ms enabled. 

MeeOre discussing the control structures of the two 
processes, it will be useful to describe the operation or 
tee IPCE facility. IPCF was needed vecause the model could 
mot ve implemented in a single process. UNIX provides a 
“sleep system call (Ret. 28] that allows a pregram to sleep 
Por some time interval, i.e. until some network event is to 
me eexecuted. The only condition that will wake the program 
from a sleep is the passage of the time interval or a 
special type ot software interrupt known as a Signal. A 
progrem cemnot simuitaneously sleep and accept TITY input; 
Scemersely; @ rrogram in TTLY inrut State can t be 
rept uepted -zCr a time-out. This situaticn necessitates 
Melt command input and event execution be handled separately 
oy GarpHerent UNIX processes. Tne processes, althouvgh 
Seeratime esynchronously, do ne€d some way to exchange 
Peecemation. The typical way for precesses to communicate 
in UNIX is by use of the pipe mechanism. Use of a pipe 
Meautr]s syncorenization between the cooperating rrocesses, 
Recememeetrocess heS to be ready to read what the other 
Mreces on the <ire. A pipe, therefore, doesn’t really solve 
mre oragine! problem; it cnly changes its form. What we did 


Sas Meplerent a primitive torm of interprocess comrunication 
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mee ali@®ws tme processes tc run asynchroncusly, but still 
Exchange information. 

THE approach that we adopted was to use disk files 
4S mailboxes written by the sending process and read by the 
destination. we used disk files bécevse the UNIX Large Core 
Bueter Wrea (LCBA) facility which allows processes to access 
common main merory was not implemented on the NPS’ UNIX 
system. THe system wes designed strictly ter use by the 
CNCC model and no attempt was made to introduce much 
generality. IPCs is made up of three FORTRAN subroutines: 
SNeMSG, RIMSG, and CHEMSG. SNDMSG writes the message to a 
disk file named XPCr###, woere ### is the Process Identifier 
(PIL) of the destinetion process and then sends a signal to 
mee 68ESULLilmnation€6frocess. Reception of this signal causes 
meer mec tigation process to interrupt its current activity 
(usually sleep) end execute RDMSG which reads the message 
oraemenere AGise Tile. CHAMSC is 4 rovtine that checks fcr and 
reeas eny messages thet might have arrived while the 
@eocl@ation process tad disabled IFCF interrupts, typically 


when performing non-interrupteble activities such as I/0. 


—_ 


PCF provides couble-buftering and is, thus, full-duplex; in 
PaeGouige, heyever, it is typicelly used to ress messages in 
Omty One direction. 

Mie dasic control structure of the parent process 
Gorecy; is Shown in Figure t1é. The progrem is event or 


CcomMand-driven, i.e. it runs only 12 response to network 
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eeeets OT Operator commands. EVCEK and EVCMD both call a 
Single dispatching routine (PERFERm) which calls the 
appropriate hendler for each event. Events that require 
Operator action, e.g. IMP DOWN, are executed like other 
ercetS but after ex€cution 60 on a special table called the 
Percing Action List where they remain until the correct 
SmeretloOr resperse 15 reccgernizea. 

Mie scentrol structure of ttyin (see Figure 17) is 
Even Simpler. The program’s sole function is to accept 
Valid comrands from the Summary TTY énd send an IPCF message 
moe emec )6©6FOT.6PaCHh €UCOMMand that it eccepts. The only thing 
that complicates tiyin is the requirement that input be 
tnvurrered, i.e. chnaracter-bdy-character rather than a Line 
et a time. In normal UNIX TTY 1/0, the system obuffers a 
wees ce 2 Lime and performs certain lecal editing functions 
Such es recognizing special characters as delete-character 
and delete-line tlags. To accurately simulate the Summary 
Te’ interpece, ttyin must exéemine each character as it is 


eed pam, consequently, "must perform its own Local editing 


ct 


.see Figure 1&). To do this, ttyin must operate in what, in 
mle Sear ramce, is called RAW mode, where Gach character is 
passed as typed to tne applicaticn pregram. the handler 
ewes erets ctrl-& as the rubout character and ctrl-u as the 
eemere-lime <Tlag. When processing the arguments of a 
Commend, the hendlers recognize the sentinels and eScape 


Characters as defined in Heterence 3. 
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SS. | BAe Structures 

APOSNDIX 5 contains a complete est Of the 
Peramevers and global data structures used in the rodel. In 
this section, we will examine a few of the ore importent 
ompes since tamiliarity with their contents and access 
methocs is a prerequisite ter an understanding of some of 
ome proprem bogic that we discuss in the next section. For 
Detter scltware management, all the glcbal data structures 
ere aefined external to the exécuteble code itself and are 
pOlmmey tO The routines at compile time by use ot the INCLUDE 
Peemo. This imsvures thet each routine has an identical copy 
ct the COMMON tlocks it needs ana eliminates the hard-to- 
fim@ ovgs thet «rise when this is not the case. This 
VeGanrquesalso makes it easier to change the rrograms, both 
durigg development and later during operational use. 

For much the same reason, we made extensive use ort 
tne FORTRAN FARAMETER statement to define I[/0 units, array 
Sizes, leneths ot array entries, and other program 
Qons tants. toes improves pregram readability and, mere 
Bee omeamyiy, Silmplities the task of software maintenance. 
Cr example, tc increase the number cf events that the model 
Samecnwepert, it is only nécessary tO change the value 
assigned to one of the symbolic constants (MAXEV) and 
Dec@heale the program. In the discussions that follow, we 
Sieme treacuently reter to these parameters both by symbolic 


mare apc byY the constant value that they are assigned in the 





Oerremt Version of the programs. Mee. Clarity, We will 
Peprpesent variable and parameter names in upper case, even 
Mecven taey are in lewer case in the actuel source files. 
The most ifportant data structure in the model is an 
feeeey that hcelds the list of pencing network events. This 
array (ZVENTS) consists of four-byte entries, one for each 
PubUre eVEent. EVENTS is an ordered queue to which events 
mre c@c-a in @eCreasing order of scheduled time. Thus, the 
mere emury On the list is always the next event to be 
Pemeculed. “An E€Vent Entry contains the codé for the event 
(SVCCDE), the event subcode (SUBCOD), and the time (in 
seconds since the start of the program) when the event is to 
pecur (EVTIME). The event codes are the ones listed in 
Figure 12; the event suocoae will normally be the number of 
See oe tyorrk cOmponrent such as an IMP or tine. The 
following entry would trigger a message IMP 2 is down at 
12? seconds arter the start of the program: 
ZVCOLE SUECOL EVTIME 
il ra be 
Associated with the events list are a number of other global 
Veariaples. NEV is the number of events on the list; TENE is 
Mee time “or the next event to cccur, 1.e. the sVTIME for 
Sweetest “entry on EVENTS; and TINE is the time until the 


pezt eWVent, cemputed by subtracting the current time from 


ha 
baf 
= 
tet 
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sore events, tor example the “IMP DOWN” ressage, 
establish conditions that require intervention by the 
Network Controller. Such events are entered on the pending 
actions list (PAL) until the correct Operator respcnse is 
Bobed- When at event is pending, the prograr will repeat 
the associated ouput message every 66 seconds until an 
appropriate cperetcr response its precessed. PAL is an 
Pores red array of fcur-byte entries structured identically 
tc EVENTS. An entry remains cn PAL until some appropriate 
Ceemaser actiom resolves it, at which time an entry is 
Sritten in the model’s log tile (cncc.log) giving the 
BVCCDE, SUBCOLT, and the time in seconds that the entry was 
@n tHe PAT. This tile can then be analyzed to see if 
agifferent operating modes have any effe€ct on operator 
pespemse times. 

There are several data structures used by the voice 
Cutput software. The two that have the most significance to 
bee wsier, since their contents can be changed at run tire, 
mee 98 det@alle! aimreys that asscciate network events with 
mares of files where fphonere strings for voice output 


messages are stored. Pen sme oa eCOntauns tWwOstyle 


“4 


SRETTes WHat aqenote the #£VCCDE and SUBCOD tor the current 
event. These are used as an index into the second, VOMSG, 
beech sCONtainS Expression identifiers used by the voice 
Gillet «6 S@sEWare )«€©6tOUlCUCUloccate)6the d «correct pheneme string for 


Geen messecge. These arrays, built from data in the disk 
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file cncec.voc, can be modified by the user with any of the 
UNIX text @€ditors. Figure 19 shows the contents of the file 
as it existed at NPS during the development ot the model. 
The comments in the file describe the structure and contents 
me =ach Entry. 

There are five tables used to represent the topology 
@ea Status of the CCINS II network. ISTAT uses a singie 
Bees tO GenoOte the status of each IMP in the network using 
mee Stadr@dard status codes descrided in Section III. LSTAT 
werrerms a Sifillar role tor the subnetwork communication 
hives. MODLNS has the same structure as the topology table 
of the same name in the actual CNCC (see Reterence 3 tor a 
mescription). VEIMP and VHNAM contain cone entry fcr each 
mest if tHe network. A VHIMP entry contains the number of 
mee 8. to Which the nost is ccnnected;s an entry in VYHNAM 
oomreoins the meme of the host in ASCII. The network tables 
ome 8hitialized at the start of tne prcgram by reading them 
mreomeaqisk. LST&T and MODDNS are in the file coins.nét; 
poem Fr coins.imp; and VaIMP and VHNAM in coins.hest. 

[were is Only one important data Structure used ody 
Etyio, a table defined locally in the command handler. CMDS 
feomompeble that contains a fPour-byte entry for each operator 
command. Bae tirst byte 1s the number of characters in the 
cOmmend; the remainaer contain the characters in the command 


mBemOonicC. 
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impdare 
linb#?é4 
Linrt# 
linm#sF 
lerc## 
irel## 
linup## 
linlp## 
lindn## 
obhac##F 
trapF?#F 
crasntF 
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imp ## is down 

line ## is down 

line ## is down flus 
line ## is down minus 
lNeene errors Corrected 
inc ## reloaded 

line ## iS up 

line ## is looped 
line ## down 

imp ## bd hst cksum 
imp #4 trap 

inp #*# crash 


ai iChat °cr an entry 


cc mn nare## 


Svedtiecode associated wwlth the tmessece 
Soececimec MS2 Nr MALT HIN this event 

= Senemic file name for this message 

se’ is Component number for this lnstance 

1 file rame is the concat of name & Val (##) 
erommanm G@eMelds ectucl file neme using current 
Malle it subcod. 


eo. Cemtents cf Yeoice Message Contrcl tile 





Oe Saisiic byent Handling Logic 

Cimeerris “ENLirely event-driver in the sense that in 
ome a@bsemee of events to execute, it would sleer forever. 
The medeél adepts é€ very generalized definition cf “event; 
eby stimulus, whetner internal or external, is considered an 
meant wader this definition. At the execution level, the 
eepeme *Gendlers pertorm their functions without regard to 
Seet2er the stimulus came frem inside the prcgram or “from 
the Network Comtroller. Fer purposes of discussion, 
hewever, we can categcrize the events inte three orcad 
sroupings. In the first group ere those events that ere 
Somcamlead automatically, either at initialization time or 
@ering operation; in the sé€cond are the responses to 
Spchator COmmands; anc in the third is a hybrid set of 
events that combines features of the other two. 

The first set consists essentially of those network 
EyvemmtS ifewn in ‘figure 156 with the additicn that certain 
Summary reports, usvelly produced only on demand, are 
Meeeeamenec aea@t the start or the pregrar and later outrut 
Seeem@etacelly et tne proper tire. Pos weet lust ra tes 1. bNWe 
Semerdti@m of themsehneculing trom execttion that imparts a 
Beeen geal o2 séenerality tc tne event handling routines. 
typically this first set of events involves error conditions 
hMiesceme DEetlWwORK Cccmponent, sucn as an IMP, a line, or a 
DOE Ln mer 6tme)|)6Shtart Of «€6Lhe program, an event is scheduled 


monweeden netwonk ccmpcnenrt. When this eévent is later 
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Seeeueed, a mew event is selected aénd scheduled. Both the 
meeersic Cvent and the time for the it to occur are 
Meverminecd by the use c* Mente Carle techniques. The 
Perticular event is selected by comparing random numbers 
srem a umitcrm distributicn against a table that gives the 
@umuleative probeaoility distribution for the events of a 
Component type. There are separate tables tor IMPs and 
lines. The program makes the assumption that the time 
DEmWEED component tailures is distributed exponentially with 
s¢cme mean time between failures (MTBE) as the distribduticn 
Parameter, MTBFsS are aefined separately for IMPs and lines 
ani can te adjusted at runtime by changing the values in the 
mene Cihec.ini. The UNIX system only génerates random 
eeens irom a unitcrm(@,1) distributicn. The program 
Mransforms 2 uniformly distributed random number (y) to an 
exporeotially aistributed number (t) by the forrula 


=e 





MTB 


rx] 


Beeweerrc JUStificetion for this formula is presented by 
comeron (Mer. 21: cp. 15¢-153]. 

Operatcr ccmmanas are executed on demand so there is 
Mmomrmell;, no scneauling tunction to pertorm. Typically, the 
preeram @All respond immediately to cperatcr interrupts by 
Besormiage seme tunction and displaying the results on the 
summery TTY. mer CGetain Steeecs of events, tncse Chat 


Fegquire some time period to conplete, a aifferent strategy 
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Roreamerrec. ihe best way to describe this strategy is te 
consiaer en é€xamrple. 

ASSUME toat you are the network controller and you 
wee vo Serd some local file tc another COINS host, say 
SOLIS. You wovla type or speak the proper command and 
Suppcrting parameters at the Summary TTY. At some time in 
foe Dour yeu Ould expect, to receive notification that the 
lrarsfer was successful (or perhaps unsuccessful). The time 
moO COmpete the transfer is a function of three variables: 
Smee stee of tme file, disk transter time, and communicaticns 
trenmsfer time. The progrem first determines a figure for 
tee size of the tile (in 512-byte dlecxs) using a number 
from @ uniform(12, 100) distribution. Then, using constent 
values Home average @usk transter tire (S¢ ms) and 
communication throughput (58x bieus/ SEC) 3 the program 
Gommmees 2 Vealwe for average traaster tire. This value is 
tee eed 6 eSlCUC thE ChCUmGermlCUCflhlUCUcalCUO real) 6odistribution whose 
Stenderd deviation is afpEroximated as 22% otf the mean. A 
es Onm TremdOmecumber iS then genereted and transformed into 
emo DOrmal distribution using an algorithm described by 
Perdan “etet. 21? pp. 15€-159]. 

Sas Mumeer is then us€d as the time to schedule tae 
Come evicn Of wae event. The ccmpletion event uses the same 
event code as the original operetor command, in this cease 
=VCODE $3; to distinguisn between the two states, the SUBCOD 


field is wede negative. When the event henaler is called, 





[oeretecr> the SUPCOD and, if it is negative, knows that the 


epert has been completed. Lt “awe i then Clypeuy the 
pemhOpriate message. This same logic is used by a number or 
routines in the model, specifically the handlers for the 
DUFP, R2LO4D, SEND, and BDCST commands as well es the event 
handler for the VIRTUAL-HOST-DOWN event. 
me “'ouce Cutput Software 

There are two parts to the voice output software 
BEveewopes «Or the mcedel. These parts correspcnd to the two 
steps involvea in producing synthesized speech. The first 
evep, if the precese is tc convert the desired message tert 
into p&oneme strings that are recognizable dy the 
Sou tumesi eer. The second is the recall and use of these 
Suriees as required ty an application prcegram. Ina direct 
meet TO-sSUeecn syStem these two Steps would be rerged; we 
Oeewer tc think cz them as distinct cperaticns 

Mme rirst stage of “the process is sSuTrported by an 
MeLerective t2xXt—to-speech Lrogram called mlil. This program 
Peeeots s@elish text strings from the terrinal, converts 
eevee tO phoneme sequences, stores the sequences in a disk 
file, ana sé€rds them to the synthesizér to be spoken. The 
Pieanman then allews tne user to iteratively refine the 
Segmewces USine a full-screen text editor. Tne. Srser 6 Can 
Se Meetmvely ttne the output strings ty adding, deleting, or 
Peemerar chboneres or modifying rate and inflection levels. 


Wematliy after a *ew iterations, the result will be tairly 
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intelligiole speech thet can be reproduced by an application 
program. 

The Vemt-—tc—piicneme ccnversion routines were 
6Fisinally developed for tne VAX-11/780 at The Naval 
Undersea Systems Center (NUSC; in Newport, R.1I.; they were 
Commgerted to UNIX F4p by the evthor. The conversion 
@lgcrithm uses a set of rules that maps letters, syllables, 
or words into Strings of phonemes. The particular 
implementation supports the VOTRAX ML-I; the design, 
however, is device-independent, since the text-to-phoneme 
conversion is achieved oy 4 two-stage transformation 
process. Text is first converted to phcneme sequences trom 
the international Phonetic Alphabet (IP&) which is an 
Seeempeca Sbandard of Linguists and phcnologists. <A s€cond 
MeulLigae perroTrTrs the device-dependent conversion into the 
ML-i pheneme codes. It is likely, then, that the pregram 
So imeroew igirly €asm to adapt to other voice output devices. 

Sach cutput message is assigned a unique name of 
Sevem or ifewer cheéereacters. The messages are stored in 
emeeertbe disk files tc facilitate eciting and recall, @.g. a 
meeseeeramea crash is stored in a tile nared “crash.mrl1.. 
PB Litreary cr fORTRAN-callaole subroutines allows application 
PeOerams tO recell amd oUTput these messages on demand. fhe 
medel uses the VCMSG table, descricted earlier, tc select the 


MmeoenwOm The Gesirea message and send it to the synthesizer. 
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me Teme ihiNARY RESULTS 

Our intention in building the CNCC model was primerily 
to create a training anc learning vehicle that might help us 
better urderstand the CNCC application and the pcssitle 
utility of voice technology. We were not looking to develop 
Semcemal rethod fcr quentifying this utility. The only 
Varieble that the program attempts to measure is the time it 
Meees Cr thee NewWerk Ccntroller to respond to certain 
Syoutse £5 we nowed earlier, a formal Evaluation experiment 
was BOL a part of our overall strategy. DesStite Iandisae= we 
aecided to use the model to obtain some idea of the way thet 
Ser Tereat input abd cutput modes atfect orferatcr response 
bimewand productivity. 

The wethodcclogy that we adopted was to run the rodel for 
epErezlmiately twO-— BOGT periogs ico wel nee me Le renin 
SmeeImonmemts. Wwe zvirst ran in a mode where input commands 
Mere Typed Marveally and alert messeges were displayed on the 
Eger “T]?, accompanied by bells that functioned as audible 
aubamnms.. Next, we used the same cutput mede out spoke the 
COmMenas tO 4n automatic speech recognition device rather 
Poe teyoing tnem marvally. finally, We added vcice output 
MO Ueesmcomtbelration by sending the al@€rt ressages to the 
meee ecn SyJMtnesizer to te speken aloud. “While the mcedel 
WES Teenie, the authcr, pleying the role of the Né€twork 
Contreller, wcerked on a parallel task a short distance from 


wee Silwmery ana Logger ITYs. (This perellel task was, 


anaes 





Deer@entvelly, the ~yroarction of this thesis.) Vhen the rodel 
Pe-Uec an alert that called Scr cperator action, he would 
peoewe tHE appropriate command by either typing or speaking 
it, a@epending on the particular operating environment. 

e@e epcsults, in terms of operator response time, are 
Gleeerizged in rigure <@. As the results show, there was a 
dramatic reduction in average response time when we replaced 
typed commands with spoken ones and an adaitional, but less 
S@mermatic, reduction when we added voice outrut. The amount 
Of variadility in response time, as measured oy the standard 
@ewtation, follows ea similéer pattern. 

Perhaps more meaningtul would be scre indicat ton, 
Mmememer SuDjective, of tre effects cf the different medeées cn 
Seeeewreh- task FroOarctivity. In the first scenario, very 


meee Mees «6c COIampilasned = 6cn )|6tme)6parallel 6 6task since the 


Semavor Seemed to be constactiy fTroving tror his text- 


mM 


@eviae Werk Steticn te either the Sumrmery cr the Legger 


4 


Mecca ctuel curation of a typicai interruption was 
petemebiy tairly brief. The exfect, nowever, was usually 
Men More Sisaificant since the time would te long enough to 
Meee tae Cpersatcr tc lose his train cf thkecugnt by tne time 
mourereemed TO the test. In tae second Opereting  wode, 
response times were Seustampiails better, DUS shoe 
ioe remenmt did not cerry oWer aS much to the parallel tesk. 
[ere wes Stiil a need to walk to the Logeer TTY and read 


BEeeectieet Tressecee before issuing the eEroper Reco vecss, 
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OPERATING NUMEER OF RESPONSE TIME 
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Figure 2@. Cperator Response Times (in seconds) 
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cormand. Tice iwene, SITUation, the interruplions seered 
Me newe mecn less cf en ettfect. There was ne need ‘fer the 
GCperetor tO frove awey from the terminél where he was doing 
Mes Utext-editiag. He coule hear the error messages and 
ceula quickly ‘fcrmulete a response, speak the eppropriete 
pOogmecaavye COfemahcd, end return to work. 


Tmese results must certainly be viewed with a cautious 


taj 


eve. irs t Of alt, simce no formal C@xperiment was 
Gom@weveds CVien bre cvuantitative results lack the sclidity 
eeeoemese Peat resvlt from a controlled and replicated 
eweeor eum Tey ¢ermit no interence as to the statistical 
memeeeerecamce ct any cf the variable factcrs. Secondly, tne 
mecoscecwmpe COMmemtTS or parallel-task productivity have to be 
memseaerea WnSUCStTantlated ctoservations rem a pcessitly 
mee SougCce, i.e. the author. Lastly, it iS not certain 
meat J'asver respense time and cetter parallel preductivity 
merece Of Weemselwes, 211 thet importent. The NEetwerk 
See eeeerer  meecs tc respend quickly tc netwerk events, cut 


meee ee Treclily a sudctantive aitrerence between Sf sé@condcs 


Gam seaececcnds? [fs cCveratt CCINS yceeration sieniticantly 
mewroveO Dy Such & resSpcnse-tife reduction? Improved 
pasate maoec CULV ity implies that everally Orer acc 


exeheee vencess will pe increes@d4, but is the inrprovemert 
ameter | «6CULOlC«SLME GAS Y )«6©6CtHhe 60 cOSEL)hOCot the voice technology? 


tions cannot de erswered by merely running @ model 
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Management appraisal befcre a decisicn can ve reached that 
MoeeGe ss technnOoleofy ir the CNCC is or iS not cost-effective. 
Peeeurmaoly, the principal value cf tne CNCC medel is that it 


Sem feeeoy ag USEFUl role in that assessment. 
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Wee VeeenOnGoks TO IMPLEMENTATICN 
eS Ee 


A. GENERAL 

Having cderonstrated beth the BES Ste Ge eliaWal= ~ oq tal(= 
Meer lity wt welice tecanclcgy in the CNCC, we turn new te 
ceeoeescusslG@m Of Ncw this technology wight be installed in 
mee Semictims Network Ccntrel Center. In this section we 
Memewecemetcer both the arrroxinate cost and the sters 
mrecived in implbementaticr. Althcugh we will examine the 
Memits ard shortcofings of existing systems, we stop short 
Ge recommending the hardware cf any specific manufacturer. 
Tae commercial rarket for voice technology is Avery 
MOlatile and the system thet seems the best chcice row mignt 
momeoe JeESt in Six months. If it is decided 10 FO dhead 
Weer evecmmceC INPURSCrE cutput ter the CNCC, a comparisen that 
exctlores the tredeoffs onetween cost and carprenility will be 
necessary tbetore tuying sreéecitic hardware. frresurebly, the 
MeeecGrvss ONS in this section cen serve es ea Ssterting point 
Mor Wais effort. mre ultimete a@cicion on the utility ofr 
eeecrereemclcey wWill,ct course, be cased cn iTS coEt- 
PewecrtivVeness to the COINS project es a whole. 

BMeCe er putwedwc CULDUL are related tut, ir many ways, 
entirely aifferent tecnnologieés that raise different 
Meret ation gquescticns. *we will therefcre adcreses them 


Seweectel;, in this s€cticn. Wemmewiule, ti2rst look et the 
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lrplerentation of Autoratic Speech Recognition, where we 
mere accumuleted a eccd deal cf experience ard where 
Pee lementetion is stréight forward. We then move on tec the 
Furkier area of sreech output where we have much less direct 
experience and where implementaticn SO Ge sn cl wede, With 


a number of different strategies availetle. 


tx 


oo Veo TPP lr 
Peomeco VOLCe Input Requirerents 

Pmemaltation cz ASK devices is se sirple as tc make 
the term implementation sound a little Prevent ronus. elle) 
Meee Ss imdvolved is ccnn€cting the reccenizer tc tne nest 
(in this cese the CNCC) vie an RS-222 interfece, developire 
amd training the vocatulery, and oreceeding to use the 
SYSTEM. Mme vVoOCedulery that we wsed tO test the CNCC model 
Gem Serve as tne basis fer turilding the one tc be used iin 
Bevel! operetionr. Mie mein crodlem is in selecting the 
Meier SeySter 1O use. This process, net surtrisingly, 
Wovoerves the clessicel tredeoff between cost and capability. 

iomeeemcer rence td ‘date  sugzests that a speech 
Meee cer used in tne CNCC <shovld have tne follcwineg 


fda Set of teatures: 


—— “aempiome a wOcabulbamy cf at Least 1¢¢ utterances; 
~—memaele to Ocerate in a moderately noisy environment; 
Seecimm@crt ¢ wireless trarsmissicn capability; and 


-— heve recogrition accuracy of over ¥52. 
e 
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momeveeb vocabulary that we used with the CNCC model 
Scmmeins ahemt EF utteramees. At least thirteen coc” these, 
nowever, would not be used in the CNCC in actual operation. 
Examples include: ‘Step the Model”, “Change directory te 
CCINS and several others that were included more for 
convenience thar Opera Gicnal BOC Ce Stas yo le) Pal 91 fal eae ae ae 
Bpoerment the entire set of CNCC commanas in the model and 
there well Demea NUroper cf Utterances thet we will 
Broewrvealy wert to édd. A 1€¢@—-vutterence vocebulary shonld 
M2 Wrewed as a lLower-rounc; 158 or 2@¢€ right be really more 
mrectice! to ellow fon vocedulery expansion. 

Computer centers always generate scme amount of 
Memememeoera FOiSe. The Danging of printers, the whirr of air 
Goea@re wonmers, ena the opening and closing cf doors and 
@hemcrs are heard routinely in any ALP operatiors erea. It 
le Me@eesscary, theretcre, that tne speech recognition syster 
mee come itechiniqve to filter out the unwented beckground 
memecee, Usually this is acconplished by use of a srecially- 
@es immed weise-cencelling micrcrtkone. 

DE WwenrOinccc OF. Carlier, the wet.ork Controller is 
eeocementiy away rem the Summary TIY censcle. It wovld be 
Siteommeeeat fOr him to réturno to the console to enter voice 
PREVL commands. Lim ewrd Sibemeameliy inccnyvenient, as well 
Me wireere, LTO vse a heedSet with wires trailing back to the 
voice input Mie. View meioamecmeCce’ 1 21LTe reaquirereit, 


Meemewor], 1O0r some trre or radio transmitter to pess 





Seats trem ithe micrephene te the recognizer. This will 
perrit convenient and sefe operation at some distence from 
Gae Summary TTY. 

® vTeaseuembe sbevel ect reccgniticn accuracy is 
eccemura!l for sueeessfiul operational use. To be beneficial, 
Speeem reccenitien shculd reduce tae number cf input errers 
Tiet wovld occur i1f cOmmenas where entered manvally. A 
Perese nc!  Mamerecturers Cl .mederately-priced recognizers 
edvertise recognition rates in the YE&Z range. These figures 
earn te @lsleadime, Rowewer, tecause reccgniticn accuracy is 
Mery Ween ae function of vocetulery design éerd experience of 
Pre users. Warreting bvrechures, therefore, should be looxed 
Sp Freer eer Carerully, the best guarentee cf récognitiecn 
peetsecy, Weenne a Test with the @esired vocatulary. 3ased on 
@mecer ence wien the CNCC model, there shoula be little 
weeoeem ins atVValming an accertable accuracy rate with 
several cz the reccgnizers available tceday. It is when one 
Mee rus wo MeOH for error rates of 1% or less that the slores 
me wee ccost curves beccme very steeno. 

Sewerc!] atthors nave commented on én interesting 
@emeviore! Phenomenon exnitited by recple cteirg intreduced 
eo werree’ inpmwt for thre first time. Hisers, ere Tuch sa more 
Gmeemeeea! “Or  vorce input €rrers than of errors in typing. 
Pé@es "Her. &: pp. 21-22] nected that when he demcenstrated 
Webdours sOt@eare ;reducts at the Néeval Postgraduate Schocl, 


Mec His freuulent type errers at a matter of 


mares Cc’ 


‘ey 
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course . rO@even, Tt. he fece one error while using voice 
input the view ciften expressed was, voice rrepuUL 1S Ie Ss, 
but it’s cbviously not pertect and has a long way to go.” 
While we cboWicusly Want an ASR system te be as accurate as 
vossible, we should guard against adcrpting a double 
Standard between speech and typing. 
2. Fossicle Voice Recognition Systems 
loSerweminingwalternative voice reccgniticn systems 
Seoeereple im the cemmercial market, we focrvsed on what could 
ee Gomsicqercd tne riddle range of voice recuse. 00 
mrcamets(tevghly S&&,@@€ tc $28,@0¢ “or a suitably equipped 
system) end tended to ignore systers at either end of the 
Deace splectrum. Seott I[nStrwwments, <9r example, warkets a 
complete voice reccgnition system (the Vet/é) that runs with 
Prve@er the @rpple I! or the TRS-8¢ for about $ee¢¢ [Ret. 22: 
Dy 20a). THe eviaence so far svggetts that the Scott 
Mecowureer ame the oher leow-priced systems do not reet 
either the vocaodulary <ize requirement or the acceptable 
mecomnrccey Level. &t the high end of the spectrum, Verbex end 
eee LE CtTTric )6€6©66DDCtTH)6h6UPanuracture systems with excellent 
Seourdc 7 retings thet ere designed to hnéndle limited 
eccnnected sreech. These systems retail tor over $€5,2@¢, 
however, @ price that seems much too exvensive for the CNCC 


Serecrr 100. 


OF cre many eemmerciarly sl telall il el Ue Secec 
Boereeeon Systers in the desired Erice rénge, there ére et 
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Feast two that prenise good SEC EOrt O18 tige CNEC 


apelicaticn. these are the Thresheld T-€00, ranvfactrred by 
Pereshold Technology of Delram NeJ. end the Interstate VEM, 
Wmeadvced by Interstate Hlectronics Cerporaticn of Anahein, 
meee Teager fay @ain sore imsight into the rapidity of 
GCiemee in the ¥cice technclcgy market by the fcllewing item. 
When this section was first being drafted, there were three 
systems beat seemed tc have spcetential for tne CNCC 
SmpPication. hem the author phoned the third (unnamed) 
Sc@iaay ster inionmaticn, he was intormed that the compary, 
previously regarded as one ot the leaders ir the tield, was 
powloneen Makiae vVcice recognition eqvipment. 

The Threshold-€909@ was the voice recognizer used with 
mee SONCC model in the NPS laberatcry. It was quite Easy to 
Mee end performed very reliably. Tke T-€99 systern is meade 
Po acme chat ncise-cancellingamicrephene, an ‘Ann 
Bevor taree cherecteér display and operéetor console, e ténve 
Smee UMit, anh analog speech ~refrocessor, an [S1-11 
microcomputer, énd ¢ serial RS-232C compatible input/ovtnut 
interface. Ae i wireless transmitter is availanle for an 
@@ditiore! cherge of about 3725809. The recognizer will 


meeermtieueas Many as 256 discrete utterances of up to two 


try 


Secem@s in duration. A peuse of at least 120 ms. is 


Pemgu i red vetween utterances. tiem LDrocessim= time to 


lies 





Mmeeerecze Gf Utterence depends on the size of the vocabulary 
Sete htt! usually be no more than 2&@ ms. [Ret. 23] 

Im Uemmes ci performance, the T-€@2 is one of the 
eco te tae frfrieid. ane founder and Fresident of the 
Sommemy, Thomas B. Martir, is ene of the pioneers ir 
aceveloping pimaietical Speeches TeECOe NDI tLOn Ssysters. The 
compemy has a@ wide and varied customer base and its preducts 
mane, Deen use@ in aimost every conceiveble application. In 
additien, the 1-6@¢ hes been used tir several intelligence 
end CoOmmend end Control contexts and the experience gained 
rem these omeraticns shculd be transferable tO an 
Seepiicevion such as the CNCC. Alia ievemell the Teg? 
RebmMecenisemature techmology trom) a stable and experienced 
CoOmiany . Tge pPrincipel disadvanteze of the T-E€@@ is its 
Beery hige cost in comparison t0 some cf the other 
Bee@eearzers One the gperket. & fully equipped T-E€3¢2, with the 
Bemeless BY Bmamcmitter, mill cost in the neightorhood of 
Sy CEO . 

The Interstate Voice Resronse Module (VRM), dy 
CORRES t., Gem DES DUTrechased fOr sass dittile es $5,¢8¢¢, 
@epecdiog om the options desired. mye Sconrany does on0t 
Peeenciely ea@vyertise en EM transmitter, but assuming thet 
een metr- aveltatvlie at a -rice ccmparabie to that of the 
Pasescnhcolda wireless system (about $5,900), the tctal cost of 
an Interstate system would be rovghly $19,020, about 20% 


less Maem oer the F-50¢. Tne Interstate Voice terminal 


iss 





(\Voterm), combines & keyooerd, display, 48K bytes of Rendom 
Access Merory (RAM), a Z@@ microprocessor, and a 1@%M—byte 
mroppy-disk drive. The Voterm will recognize up to 182 
speaker-dependent aiserete utterances. An additioral 
feature worth noting is ¢ bdoara (aveilable for about $12820) 
eee BHOVIdes a voice response cepability. The board, which 
Uses WOresxX S SC=21 chin, Supports &@0 werds stored in 
Memory “and the ability to tErogrer about 1¢@@ additional 
words. 

ewibe it vossesses SscOmre attractive and rotentially 
meeeut PTeatures, the VRM dces have at least twe pessible 
arawbacks. Prect, tne 108" word vecabulery ra; leave 
Pasterrcient reem fcr expansicn and might not previde encrgeh 
phreses for the CNCC operation. Second, “there 15 1ittle 
eee emce Wlinir ithe CS cr intelligence cemrmunities with 
the VRM. The evthcr did visit an experirertal effort et 
NASA’sS Ares Research Center that used the Interstate Voterr. 
fewever, his perticuler effort used ¢ very smell vocedulary 
Comly four worcdcs) and thus it is hard to draw a compariscn 
mere une GUC Mwhere a much larger "Wecatulery isc required. 
mre ers Pier Factors laboratory bas vplans to Eurchaseé a 
Mereeneoans Soaring, and thelr experience may aelp us to 
Sveftete the utility of the Interstete VRM in the CNCC. 

en ee cemomulal areas ctf Concérn 
Mest ORT OULT di1SCUSS1OR in CEIS thesis has E€mphasized 


Mmervos live adsmects Cr Autometic Sreech Secognition. There 


ky 








Beepeoeguesetiom thet we believe it to be a technique thet can 
Sees the Network Controller in pertorming his duties rore 
e@pectivelye and ccnveniently. Nonetheless, as with any 
Tecweobosey that intreduces tuncamental changes in operating 
peecedines, sit Ns mot withcvt scme risk. There seem tc be 
two issves that should be cerefully considered before 
proceed ine With installation ot a speech recognition syster. 
wee" discuss each cfethem briefly here, but they cleerly 
Peery’ ivurtimer comsidemathion by the CCINS PMC and (CACC 
ranegemert. 

me Ilmemt qwesiwinen concerns whe use cf voice 
reccgniticn ty mere than cne speaker on each chift. As we 
en Sei Sicticciecdwelssewmhere im this thesis, voice recognition 
Sectem> are speaker-devpendent and must te trained by each 
agian SeRe System. im other words, e€ach Network Controller 
Moaeld theatres the <cystem Mite his voice patterns and then 


mese peatTErns On @ SMéehl cCéSSETtTGs. AT the start of a 


in 
a 
Oo 
73 
mM 
ct 


ete we Ss CeOnareiber would «read tne cassette into the 
Memom Gf the necCgomitionesystem (which takes less then a 
meee) and mould thea be ready to issue voice cormands. 
Piro’ Cemeemmc DPFESUMES, seOWEVEr, thet,oq@dy cne “peretcr over 
Tier MieloAse Tae avolee Suulpregta. If this assumption is 
Pere ec Gee Some alternate prccedture will have to be adoptec. 
men oOser leet ,, if the vocabulary is srall encugh, is to 
foo moemere “bnar One coerater on a single tape. Piyis 


meesGec>h. bas been tried intormally at NPS and appears 


pes 








Preeeerorhs: Depending cn the size ct the vocatulery and the 


ty 


im@enuity used to design it, we might be able to store as 
aey 25 three cevs of voice patterns on a singie tare 
cassette. Tae preblem is act insurmountable; it simply has 
mo “Ge addressed before voice recognition equirment is 
ee@mected and inectalled. 

The secomescomeern svers fror the fact thet otr 
Stuéy Of the CNCC was conducted second-hand at a distance orf 
SoCo mikes. We have ettempted tc digest and wrderstand all 
mee|60O0U CPERaATLIiNg 6p Toceduwres for the ONCC. We are not raive 
Eugen 6UtOlCéiéE Liewe, however, that manuels of operating 
mecceciiaces always accurately mirror the oreraticn that they 
Mier t BO deseribes Fcrwmany cf the situations that arise 
ye Lhe cOnN> lil retwiork,e the “GaCC cperators manval 
Meee cribesS valternative comrses cf acticn. For, @Cxdmere, 
m~mcl youre noccamres can often be initiated fror either the 
SOmmemy T1Y using the CNCC ccmmeand language cr the Master 
mw using a different set of commands. Cur analysis and 
supsequent model heave mane the, perhaps S Dmeiterene 
Sweet on. Of a simzle date Entry voint -- the Surmary TTY. 
meee "RTE thet tnis asstmprion is reascnablée, a4 single 
Meaece Tecose. tion device will be workable. Axéin, it may be 


eso 1ctle GHet the ceonverience offered ty voice reccéeniticn 


go): 


Memeo cevse tre crocedvrés tiremselves to be modifieac so that 
fe acorpetton Of ~a single commend eatry position would he 


mere cel ietic. We raise tnesewquecticn rerely te generete 


tay 





pmommov.amd discussion sO that they end any other questions 
tnat may arise Beeause “of the installation of a voice 


recceniticn system can te resolved as early as pessible. 


& “WOICa OUTPUT 
1. General 
With wolice recognition, there were really nO 
BRP Lementation issues other selection of the best equipment 
at the lcwest cost and the need fcr careful plannine:. Ecr 
fees Sreasen, ewe focused cn reguirements and the different 
Gapabilities cf twc pessible ASR systems. When we turn (tc 
wee ememmut side, we Encounter a different probier. With 
ouepmt, the nequiremerts are simple and can te satisfied 
Water awhole renge of relatively inexpensive preducts. The 
Peereen tibet arises comeerns the way that a voice output 
@ewice is connected to the CNOC. AS we noted earlier, voice 
@Uteut is software-intensive. The fundatrertal Goel Se aq 
Oe t0n, then, becomes one of where to locete this softwere 
Memminirize tne cost and Overall imtact. We are interested 
mortem tee ticuler ccmmercial preducts than in the broeder 
meee OF inplemenotation strategy. 


Mmreerare CHiy twe <2uadame: tai requirements of a4 


in 


Mereee OUtpuUt system in the CncC. Pecels ft Shou 
oeenenc. producing speech cutput frem prestored messages 
that is intelligible to someone who is farilier with the 


Poueuluevccameablary. Second, implerentation chould requir® no 





mer Sscrmware ar @ardware modificaticns te the existing 


CNCC, end preferebly none et all. There is no requirement 
mee 0€686cdGUlU(<Cd SLI MCE 6 OKThCUtO Speech capatility. Nor is there 
ever e need for humen sounding speech. A voice outrut 
eeborm™ ,qgowld be in addition to rather than a replecement for 
Mee Logger TTY. Even if the voice output messeges were not 
Meee cote «€©6j@m@elligitie, the system would still te at least 
pemeeoce eS the 8xisting one. we do, of course, want to get 
fae BES SwsStem wmossdbdple witbin some cost range; it can, 
meeever, be eifective even if woice quality is not the 
highest. Voice response systems are available on the market 
Mee prices as leow.as $400 up to more than $10,242, with most 
under SSead@@e@. meisee Or our understanding of “the CNCC 
Cmeratbion and cur experience witn toe CNCC model, we see no 
reeson to 20 atove the riddle range. 

We will censider three alternative implementation 
fCtretegies. Gece Girier meinly in =the locetion of tke 


moar speech software and is overall flexibility. 


é. 2 he vVceice OUT DUT terminel EUpeLLons as an 
B@Mreeerti2zent peripoaera! device connected directly tc 
WimewreiCC® Gi@wmBULeET. it sericrms nce computing Functicns 
Cer 6 CUCL en CUCM ©60C COO VErsSion Of the phoneme strings to 


ChE Pt speech. 


Pee te Yoree Cutprt @evice “ise stend elone voice 


resrorse C OL Vee baat mOi@ild accept aAaScil=-cecded 





(mere Cter strings from the CNCC and transform ther to 


Momwce OUTTUL messages. 


Cee eee imesiaer is a slave peripheral, as in the first 
Seerraey, DUT 1TS Rost cofmruter is not the CNCC but a 
Bemeral-~rurpese micrcccmputer that weuld hané€le the 
V6mer—-speech trensformation and Cite cOmputing 


ronctLonas. 


In this thesis we are mere concerned with overall 
Sy owsommemohwrecture than with eveluatine specific cormercial 
mrcducts. fn the discussions that fcllcw, however, we have 
POwmdad it mecessary to establish some be€nchrarks so that we 
Se wececTiIve approximate costs to the differert alternatives. 
Wer the tftirst end third alternatives, our béenchrark system 
Us, wae VOTRAX Model SVA synthesizer manufactured by Federal 
Some orgs of Troy Michigan. i7eee oie 2S en igeroved 


4 


version of the ccmrpany’s popular ‘“Tyne-“N-Talk text-to- 
seeecn converter. EQUamerse Mule vounnk SO-8) synthesizer 


Prec Ulmmrece om Nas four times as much memcry and better 


© 


Seeepel Omer speech intlection; it costs abovt $2,902. The 
VGtra. TiL-!, wWhicn we used in the C&S lav, dces not seem te 
mommies SGOOVen Extra capability to warrant its $1¢0,¢@@ price 


lag, at least as far as the CNCC application is ccncernec. 


trf 


Seemee ee secodad Option, our benchmark syster is the SLC-II 


Peveliyeent Ceommunicaticrs Gcrtroller manufactured ey 


}+ 


Peace) Pateways Inc. ot Palo Alto, California. The SLC-II 





oes tome )«6cadtCmme )6C COmputer 6 6wwith€)6vvoice)6cutput6[6uccapabpility 


Oueeev-am;, it sells for about $2,29¢. 


coe oeme TOlce OUTTUt Peripneral 

Bhis design epproech, which is the cne used for the 
pre (eel at NES, views the synthesizer as a black-box 
wnose Sole function is to convert phoneme Strings output by 
Pmemreet cOmeuvter into intelligible speech. figure 21 shows 
the synthesizer connected to the Logger TTY in what is 
referred to as a Shared Channel Contiguration (SSC). 
Sieverretively, if there were e sufficient number cf serieél 
intertaces, the synthesizer and the Logger TTY could be on 
Separete RS=-Z5-2C deta chennels. The burden of scamehow 
geTmeratineg re Hnonere Stmunmes 1S berne by the host 
Computer, in this case the ONCC. We shculd note that the 
SUS Gees He@we tairly good tezt-to-sreech capatrility that, in 
PeGey,@NOuld @ean that it covld translate the ezisting 
foeeer TITY messeees directly to speech. While we recognize 
Beep Ccemoelaty, it does not seem That this alternative is 
wery precticel. For one thing, text-to-speech elgorithms 
gee Oe METtect and, bes@d on our Sxeerience, the results 
Sire Om cmmblme nct be satistactcry. Snglish is net régarded 
as @ very chonetic language ana text-to-sreech syaothesis ty 
rule might cause the word IMP’ to be prencunced “Eye-m—-p- 
or the word LINE to come out as ‘-L-ee-n. To get the 
meeessary control cver oprcnunciation, the phonere strings 


SmOovld @robadly be prepered in edvence. Thus, ealthouvugh we 


V2 











acknowledge nea wine Vemts tC-—Speech conversion asa 
Poorer, are tThimk it more realistic to consider the 
merece case, 1.2. were the ChCC must output the phonere 
serines. 

Meeme 1s clearly a serious drawback to this approach 
Mee Seat it requires software changes in the CNCC. It is 
meme coms recriznge, however, how we might irplement it in 
Geicumee «Way as tO minimize the ameunt of change required. 
Simmer Of wall, mime criginal text-to-vhoneme conversion should 
moe taxwe place on the CNCC. A modified version of the 
COEwerston program developec at NPS could be run on a COINS 
PES te produce the phkcnemre strings. These would be moved to 
wae CNGC end stomea on disk, requiring ebout 12,026 bytes of 
Of-iefe storare. The CNCC program wonld te modified to 
recogmize the sitwations wnen a voicéE€ outrut message was 
pe@wired, read the aprrovuriate thonere string freer disk, and 
OUEPUL it te he symthesizer. As a ercsS eStimete, this 
WemeG require at least S@¢=-1¢2@¢@ worcs ot wain remory for 
ce@e and stwiters. 

Pim Sec test OmestimMete thesscost of tke CNCC 
Seo m@are Changes, cr whether in ‘fact they can even be 
eecemolismed. Eesed on experience, however, it is heard to 
Sebi Liat boe cost mould be much less than $12,272, 
eccumame the work was done under contrect. This meens that 
BEC OGG Gr tnls OCLiOMW would be at least 5$12,¢@@¢ and cculd 


MemeeeoOSCsict ee Ce hieher. wmeovtegtihe only real adventege to 


£25 





[eemwreeGOodco 15 the fact that we cau use ThE TEXt-to-speeca 


eae that we develepead at NPS with only Slight 
moadrT i Cation. Pieomtat Case, DOwever, the host corputer wes 
considerably mcre pewerful and flexible than the Honeywell- 
ore. Bly Ga ali, this alternative is potentially very 
costly and there is scme risk that it can’t even be done at 
aol. It is of interest more for academic than practical 
reasons. 
Oo. soben@ Aone Voice Response Corputer 

This design, depicted in Figure 22, eliminates the 
Pees f0r enmy sormhere modifications to the CNCC computer. 
Weseaees would be sent te Whe Logger TTY as they are 
CUmremeee ye OUR would ectvally be read by a microcomputer such 
pomme Dieitel Fatuweys SIC-1!. This computer would pass on 
the messages to the Logger TTY but would first scan them to 
@etTect those€.that should have an associated audio response 
message. Lope tee) Voemiyoroprocessor wcnlid retrieve and 
speek the appropriate output message. 

Blew obG-li 9 is a micrecprocessor—based communications 
Gomerollier that caf support up to & serial I/o chennels. 
Mt url ti-in speech synthesizer can te usec t0 announce 
events over the unit’s front panel speaker or via the 
bewermomee, It is usually tevght the details of a specific 
eppiicatien by means cf simple commands to its cwn cperating 
@powemeca!led SEPSIN. The profile of tne application is 


Gmemed im battery-cupperted memcry <¢o that, after the 
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mMieure 22. stand Alone Yolce Recponse Computer 





Weiewr al training session, the unit can be considered to be a 
Tieecey tailored fcr a specific application. Alone with 
Mme UeSic UHit and the opé€rating syster, the SLC-II is 
Geeamepee «2«=Wth «6a lh hUVersicn 6 6ct micrcsoft BASIC to extend its 
Serewmore Capepility. The SLCO-IT uses a proprietary bank- 
switching techniaue that allews it tc access up to EOK bytes 
Gr sam [Rer. 24: br. I-1]. 

The SLC-II would te interfaced to the CNCC ccecmputer 
ame the Logger TIY on sSeriel RS-232 or 28 mA channels. 
Serors provides the capability te define KEYS that are AsClI 
[eriees tet the SEC-II looks for on its serial line from 
Me-wae@se. Asscciatvec Mith each KEY is an ACTION. [f a KY 
i eeceowea In the Midst Of the serie! date strean from the 
meee, St Mamediately triggers an ACTION; KEY on triggers 
moerom Oo. ACYTICNS cause particular trcgrems to be executed, 
eeo. toey Gay diel pcphone numbers, speak sentences, étc. 
Meee celia, in tme case cit tne CNOU, the ecticn weuld involve 
speaking a sentence stich as IMP 5 is aown. Sentences are 
cemposed from a pre-~-stored vocabulery. 

[ieee - tl nae Several attractive features. ey 
Semrepea, With @@K bytes “er memcry and all scftwere, it 
GOmrts comes MOTECVEr, 1% TequireS no user softwere as 
me fest vne Gellnition of bis, ACTIONS, and SENTENCES 


wWammeye cad be cone by sOmeone who is not an experienced 


ec 





programmer. [t iS compact, with a built in speaker, and its 


us@r manual is cleer and readable. 

There are twe petential drawbacks. First, the voice 
Becacmlary Uheteis ([rovided is not déetined in the manual, so 
Deere is noweway ef Anewing whether cr net it is adecvuate. 
em MOCauuvary regquireémerts of the CNCC are, to be sure, 
Small, but a mumber cf uwnccmmcn words are used. Pt, acs 
Poigcusot ec UsataienmecesSsary, an extended vocatulary could be 
purchased at some extra cost. Potentially a bigger precbliem 
is the limited number of KEYs and ACTICN’sS, currently set at 
me Seem. “Zor the mcdel at NPS we used cenly 12 basic 
Sentences. Each hed” a number of variants, however, for each 
OF tne aprlicatle network components. We ite Vea Salo 
Verdemts “of the IMP dewn message, for example, one for 
©€ach IMP in the COINS sudnetwork. There is some suggestion 
in the uwuser’s manual] that these limitations misht be deelt 
Wee ee, Powe ree UsSer—Calbabple subroutines which use special 
Mee wc Geeess Called buiers. Euffers seem tc vrevide a way 
Lo” tTormudate an speak Semce vce S dynarically. oe 
COmparatively lcw cest and the ease cf implementaticn 
SUpseou thet these issues should be investigated further. 

a Geemccomonter-—Coatecglled Synthesis 

fees erchitecture, shown ie Paar Te Zo, is 
Sssemgtially an atvemrrtr to combine the advantages of the 
Senet ioe de st2ns while minimizing their <Arawtecxs. ie 


snares with the first the advantage of a cenvenient and 
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Deomenreat Wexrt—tVo-sreech editor and, with the second, the 
edvantage that ao changes to CNCC software are required. 
P@aitiomelly, it is»mere flexible then the second approeéch 
veceuse it makes available the tull capabilities of a 
Sten@ard general-purpcse cperating system. 

Dee system @ould Work in the following way. The 
user wWeuld emplcy a special full-screen editor tc generate 
Me Pheneme Strmaes to drive one of the VOTRAX spéech 
omrumecizers such as the SVA described above. These strirgs 
Weul@ be Stored in stendard CP/M text files. An application 
progmam running under the CP/M disk cperating system weuld 
capture messages sent by the CNCC and pass ther to tke 
beseer TTY. It would alsc scan each message lccking for 
Guasm Gomeitions. Onree tke progrer recognized an alarm 
message, it wceuld retrieve the ccrrespcnding phceneme string 
amd send Tt to the SV#. 

Resoceca Caltor that could te used to suprort this 
Strategy was developed i. TSwenep- si revisStic Reseerch 
Rec eerrres Om edi c Park, Calitornie and 1s marketed uncer 
beer the trade neme SpeechWizard. SpeechWizerd is a full- 
SCmeeneareor 10r ceveloOvping ~hHhorene codes for tne entire 
omy es Cf “YORRAX syntnesizers. it is not a text-to-speech 
peepee OU Tratmer a Speecn editor tnat generates codes for 
S@erea “seccuulary to Be used’ later in application programs. 
Preemeesemse 1 1S similar to the mli @Gditor developed on 


UME Aeeor the CNCC model. It carn fe run on any micreceTputer 


Vo. 





Met SUoports the CP/M operating system, which includes 
Sieatvetiy CvYery Rejor personal computer. Using a special 
SCUNDSPELLING system, the user can generate vcice messeges 
wepeeut learnige about acoustic and articulatory phonetics 
ame witnout dealime vith hexadecimal, cctal, cr ASCII ccdes. 
Record 1 ng TO tae develorer, te. Canol iA. Slmpson, 
SPEcCGEMIiZard hes beer tested with linguistically neive weers 
who have used it tc create intelligitle speech [Ret. 25]. 

Speechwizard is being used at NASA’S Ames Research 
Gemver tO Support a res@€arch effort designed to assist 
merevccume, Mlrets ilfying low-altitude missicns. The system, 
ieee tivee on a SCEL=20 SvEC-tased microcomputer, seems easy 
fomerewona BAe OUT DEL Speech, vroduced by a YOTRAX ML-I, is 
Guear ama intelliegitle. The price for SpeechWizard is $25; 
Ge course to use it reqvires a Microcomputer thet runs the 
Cryr. diam Operating syster. Tne cost of such systers will 
Wermeoue a Barrly well-equipped System can be purcheseéed for 
about oe cee . Mssimeue that “we used the YCTRAX SVA 
Eyareiectzer,tne total cOSt WoUWld be in the $6,20€-$7,920 
range. 

Meptementaticr of this alternative would SUS 
Feevare tke tevelopmert of é& progrem to scen the CNCC 
messages and select the appropriate voice response based cn 
PPeweCntedwtsc. M1lS progrem Can oe Written in @e kigher-level 
lagguaeze and should te relatively Straightforwéerd siace it 


Mor wescesttcatly <« Straight Cherecter string ratch and table 


ee 





lookup. Someone familiar with the CNCC ana CP/M should be 
able tc develop such a prcegram in a week cr less. Althcugh 
Pi omerrs a little mere than the second alternative, this 
Strategy orcvides a mecad deal mere flexibility and 
possibility for enhancement. The number of possible output 
messages has mo practical Limit and the quality of the 
output speech is good. Since SpeechWizerd was developed by 
4 pretessional linguist, it embodies a considerable arount 
oO” linguistic knowledge. The availability of a pregrammable 
ricrocomrputer eeser OrensS Sp Other Spossibilities. The 
BucteCGmeUmer cowld; tor example, retvormat some cf the CENCC 
SUemery Feports to improve their readability. It also could 


Setcormm seme lccal command editing and provide veice 


yO: 


feedbeck to the Network Controller. While we are not really 
eeieeme = «Or |6U6UddlUCUmeyhCUCTCOlCUdimpreve)6€6Ccther€6aspects.)6—hGUc? CNCC 
Operanhion; if the vossivilities for such improvements are 


BECO eo UG 1b wOucld Ss€Q@meshortsighted to ignore ther. 


”. GOroiaee YOIOS INPUT AND OUTPUT 

AbtDoughs yoice sale bee and ORIEL Gearesent Two 
ibpeewenmdgent tecamclogies, it 1s matural tc want to combine 
bigem in @ simele system. In several pléeces in this thesis, 
w© have alluded tc a clcsed-locp system combining speech 
Ree@ene7+sOn and voice response. implementation of such a 
Spore wend «=Nct be easy, particvlarly in view of the 


Sustifieble reluctance to make any modifications to the CNCC 
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software. Nevertheless, a step in that directicr might be 
Dosstele if we could install a system connected to the CNCC 
eet SUppOoTtved both speech input and ovtput and alse allowed 
fOr Wee development of user software to employ them 
gaomclligenmtly. 

She Interstate YVOTERM, which we discvssed above in 
comrpection wi td yen ce IMbutL, wtepevides at least the 
PeoceuyetO }6hCCRStmCt lC6USUGhlhUAmlCUS¥SteEm 6 6UWhich)6 6omight~— hUte 
Geme etred 4s showm in Figure 24. The VOTERM is c&@—-based 
an@weremiaes en bytes of RAM, an i1¢¢M-tyte floppy-disc« 
drive, a keyboerd end display screen, the discrete-utterance 
Voice Recognition Module (VRM), and voice response ‘rom a 
S@eeword wOcebulery thrower e VOTRAX SC-O1 chip, the Seme 
Gm@emmeed Bh wee S¥Asscynthesizer. Base cest for the syster 
moc gem, SG,000 for the VOTERM itself and about 
So .2@@ “cr a wireless transmitter: an expanded mcdel might 
Gcom els meen aS $15,200. Tris if in contrast to a cost of 
Omer >20ypera if we used a T-E@2 speech recognition system 
Croom rer ocomeu cGir—bDeSed VOICE OUtTfuUt device. AS we noted 
Soewens we don t have much direct experience with the 
Memeete oe Prectct and it May come up Shcrt cn some of the 
CNCC requirements. Its 1@@-utterence TE COant on 
Reever, , “scr exemple, may not te adequate fer CNCC use. 
Me ebility to a0 write vser programs in Gena ever 
languages may provide a means to overcome these limitations. 


Ta ligmt of its reletively low cost and in view of the 
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Interstate 
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Meauwle @ elatye 
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Figure 24. Cemtined Voice Input/Output System 
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Pepgetional advantages of combining input and output a single 


mee bars Sitemeeyei Ss certainly worth pursuing further. 
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VII. SUMMARY AND CONCLUSIONS 


Se oove taren severdat dif@erert paths in the preceding 
meer solons, le Seems in Order, therefore, to try to bring 
Wreoe Patns tOeethner and summerize our findings. We first 
aeveloped a cror le oF common Cherdgeveristics OT 
aprlications that have successfully used voice input. ite 
CNCC applicetion scored feirly high when matched against 
Pees emerite. Wa iess formal way, we looked at typical 
Seeeeeations of “Volce output and again ccncluded that the 
CMEC Was a reasone@dle candidate for application of this 
LecmuolOge,. 

Having esteblished the potential usefulness of voice 
peoummomoe, it the @nNCC, we proceeded to develop a computer 
wmode! that would help us evaluate the feasibility of VIO 
Soe at "eae Sane tine, eive a prelirinary indication 4s to 
Gwe comtributicn that it mignt make tec Cae overall 
Sr eeeviwemess Of the Network Controller function. Our 
cenclusions in this area were rather tentative. Wwe showed, 
for example, that use of vcice technology resulted in marked 
ber oVeEment in operatcr response time for cne inicrmal test 
Wet COmecered the different oOpereting environments. This 
sare test seemed tc suggest that the use oe vaice 
Peememergey;, Perticulerly voice output, would increase the 


Eroemenivyruy ot tne Network Controller in Terrormring 





meralile) wasks. Weair, caution must be used in interpreting 


MmECeeeGeCSsyL US Since wtne tests were informal, unstructurea, 
age vunreplicated. The conclusion with regard to parallel- 
Vase prodvctivity, in perticular, must be viewed as tke 
Emeyectire imenessnen sof a single test sudject with no 
Guamtitative measure tc Support it. 

fe ape lic@ebility and feasibilistty established, we turned 
nO Umen cCOmsideration cf hcew best to integrate the tecanclogy 
imt0 Gabe Cxiestime CNCC. Installation of Speech Recognition 
Sguiomenmt was Skiown to ve straight sorward, invelving neo 
charges to the current hérdwere or software configuration. 
Bt least two Manufacturers offer systems that would rcrobably 
meet the neéds of the Network Controller function; selection 
Gey Geee ome TO use involves the classical tradecff between 
Ges tmena Ceapeblifty. The instelletion of a voice input 
emeece mould Mecessitate modification to sore of the current 
CNCC operating prcecedures. As with any change to. an 
Chi Sing Opera tlus Covel no nines t., Care u 1 Clanname is 
essential in order to maximize the tenefits and minimize the 
pemential disruption. 

Mee erTOoveri1 lity ot operational disruption is much lower 
“or voice output. The addition of spcken messases, even if 
ere se peeon Quality is not too high, is unlikely to hamper 
CPEIParrenr . mime rest sePloues problem with volce cutput, at 
Perper ns time, 1S that 10 ré€quires considerably more 


S@SG@Weee Vrem does speech reccgnition. oe mio Lemed tela 


t 
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aearo response cepability without software rodifications to 
tpe ONCC will require that the voice output device be 
Oriven Wy an “external computer, either a self-contained 
Voice Resronse Computer or a general-purpose microcomputer. 
Whichever appreach is chosen, a certain amcunt cf softwere 
Gemgelorpment and maintenance will be recuired. oe 5 oes 
context, sceftware includes anything frem creation end 
Batting of woice messages to develor;mrent of simple programs 
in a higher-level tanguage like BASIC. The key point is 
eet, while we ray ve able to frinirize the amount of 
eortware werk, we cannct eliminate it all. 

In the section on implementation, we included some 
discussioa Of Utne@e cOmparadtiwe costs of the different 
alternatives. This is a difficult eré€a to summerize since, 
wero Fe ExepeClTeC , Wace cost picture is a comparatively 
od one. Poe iSwmeeGcscmpme, nonetheless, to at least 
Soeur od  S@me cOStT bOENdaries as guidelines for decision- 
jose at tne low end, it is possible to irplement a 
Somoeeee WOrce lmput/Output system for as little as $11,9¢¢; 
Seer DOtvemitial CoOnfTiguretions covld cost twice that much, 
Sad stemere dr= a aumber of alternatives in between. Our 
investigation has shown thet the technology can improve 
Seerevirons in tae CACC. The guestion of whether or not the 
Deere vement is “Worth the cost is, quite properly, a 


Mane@eement rather than a technical one. We hore that scre 
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c* the grcundwork dene in thls thesis might make that 


emestiven ‘easier tO answer. 
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Were 1X oA 
OPERATING INSTRUCTIONS FOR CNCC MODEL 


ieeae Instructions assume that the person whe wants t0 
rum the mced@l] hes a werking tamiliarity with both the UNIX 
ele system enc the CNCC operating procedures 45 
cutlined in The CNCC Operators Manual [Reference 3]. Given 
Waese assumptions, Penning tHe model is faite 
wre i2hrsorwar'd . Pie Process invelves three stevs: first, 
wre haat (UOTE. 6 | 6SuUprOort files contain the Correcs 
infcrmation; second, running the medel; and third, analyzing 
wee results. 

There are a number of tiles that the system uses to 
initialize internel parameters and tables. The casual user 
Re Only eerer heme to Make Shanges to one of ther, the file 
cmec...int. mois File comtains 4 number of parameters end 
switches that the rrogreamr uses while operating. The file is 
a) cswoma@ard UNIX text file thet can be modified with any of 
Doe aeexm Yeattors, such as the full-screen RAND editor. Te 
‘le also cOmteirs comments thet essist the user in makirg 
Cl@mages. Tne contents of the file as used at NPS are as 


-Cll Gas: 


46.0,864202.2 ,664040.2,-1,/aev/ttyh 
rtmul = 4E.2  (ieeoumuemes real time for this rus 
mittee =~ "e6420.2 5; mtdof for an imp in secs 
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pmatbprt = c€400.0; mtbft for a line in secs 

works = -1 » VOice output enabled 

Moreen — /dev/ttyh ; device for voice output 

fr] PeGemeter fields contain numbers or ema fac Ger 
StFings and are delimited by commas; the format is fairly 
Begid and ail fields except the last must contain the 
re@vired number of character positions. The first parameter 
CHTMUL) is the multiplication tactor that the model uses to 
¢peed wp real time. In the example, e€ach half hour of 
Progmeam time Would correspond to an entire day of real time. 
ihe serameters 2MPEF and LMTBF represent the MIBF for IMPs 
Gc Weumec respectively. In this case, MTEF for both iS one 
day (86,480 sé€ccnds). The pregram will apply the real time 
DieecalLdoOn factor to these values converting them, in 
this Camm, tc “<0 minuves each. The parameter VOIFLG 
@etemmines whewnper or not voice oOuUtrut messages will be 
us@e. ie Gees Venerpic, the value cf -1 turns on voice 
Tat seomcmmecule itwetne vse@n would change the value to @. 
rewrote herd contains the UNIX path name fcr the device 
Boece t@cenve as the Logeer TTY, in this cease /dev/ttyk. 
SMigcespoewmaes  fleribovility in configuring the system. Note 
PaeeeasS tie System is currently aesigned, the voice output 
Oe tee aaa tmes oeeer TTY rust be tne same terminal. 

Once the user has mede any reqvired changes to cncc.ini, 
oer Ope cere 1s ready to run the model. To co this, he should 
DOG@N Gn Dev the Summary ard Lcgger TIYs with the seme 


USB ip son veto devices. Next, change the working directcry 
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On toth terminals to the directcry where the software for 
the model pecides. (At NPS, this is currently 
just/malarkey/ccins.) Next, on the Logger tty, enter. the 
commend “beck”; this will start up a background process that 
mie. periodically myre out the current date and tire. Then, 
On the Summary TTY, enter the commend “cncc’” and the rodel 
wWidl Start tO execute. The program will first tyre out the 
Gumgbent date aid time on the summary TTY and then will be 
neegey wOwaccept colmamd input. The tollowing commands are 
Gumnenta y Siti De Pitred:: REPORT, PROALCAST MoO PRINT 
Deaton’. PRINT TOTAL, SEND LOCAL-FILE, DUMP, PENAME FILE, 
end PELETE FILE. see the CNCC Qpereétors Manual fcr aetails. 

Pn ete rclon, two Scecial commands have been included 
Teemeeeraeer COnvenlerce. it may ve useful for the operator 
to change the status of one of the lines in the network. To 
mo tmesy 42 cpecial command has teen implemented. The user 
types 71 3} the system responds with “INE STATUS IS°; the 
user then enters either ad’, a “u’ cr an “1° te dencte 
Cae toe ew lime 1s Cithrer down, up, or looped; the system then 
responds “LIN=: ° and the user enters the lire number 


followed by en eScepe character. Here are two exanples: 


Oe Aes IS uP LINE: 2$ 
TIME STATUS 15 100PzHD LINE: 149 


Oy teece 1 Sets coppand 2s used to restore a line that has 
peen marked down by some retwork event. The other speciél 


COMMmand = EroOyides a Way to gracefully stor the- rodel. To 
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halt, type “?q'°; the system will respond “UIT CONFIRM (Y OR 
Mm) emd the user responds approrriately. 

Peewee Tunm@ime the model, the user may wish to exemire 
the resronse times for the run. These results are in a file 
oemeee Crec.to2- Eaen line in the file corresponcs tec cne 
Skee ema, SequlTred Operator action and contains the event 
Gere Enemotneere, end sthe number cf seconds that elapsed 
Det@eemetnewtrme Virat the failure occurred énd the time that 


an applicable operator response was noted. 


Je gaes 








LPS Tx” B 


PARAMETERS AND GLOBAL DATA STRUCTURES 


PARAMETER DEFINITIONS 


yarameter evisz = 2@¢@ ! size of event lst 
parameter mexev = 56 ! mex nr of events 
trearameter leve = 4 even yee even ueen thy 
parameter palsz = 49 ec. Campenarne acts 1ST 
peremeter lree = = lL slieth of Gal entry 
perameter onlines = 14 por, Oe nes )n Re Twork 
parameter nimps = 5 ore ae toe) Ne ywork 
perrameter mhosts = <2 (Sir eot Network Hosts 
WMeraMmeter crdtty = 5 epun. Got OT 20.T oC I Ga ane: 
varameter sumtty = ¢ Pen lee 2c Pe oh imma nr yoo. he 
perameter disk = 1 (dees f/ Od. t number 
parameter bo@til = 2 f unit # for log file 
parameter mail = 14 Eases anterrun, s tera 
parameter mken = Sd Werenerine of per mrs: 
Berdmever Taw we Pigeon Or caw et tyr 91/0 
parameter cock = 1777S% Peles 2 Orecocked tty 
peremeter nvmsg = 12 PRO te VOvce= OU Gell omses 


CLOBAL DATA STRUCTURES (CCMMCN BLOCKS; 


CGmmcom PLOCT =VIN?F contains info on Event list 
coOmmon/evinf/ttre ,tfne,rtrul,rtinv,events,evlp,nev, 
= Stume’, Cl Wee, evecde ,eVvVDTrms,evsupp 


Cide loony Next event 

Time FOR next event 

Real time rultiplier 
averse of rtmul (1/rtrul) 
@rcered list ct events 
Poimeer tO next event to edd 
Dreoreevyents on list 

Ree ota vime 

currert time (rel to stimre) 
@cee Tom current event 
subcode or params fcr Event 


integer ttne 
integer tifne 

neal ftmul 

M@al npinv 

byte events(evicz) 
imvecer ev)lr 
lateger nev 
integer stire 
lnveezer ClLlme 

byte evcode 

byte evprms(4) 
byte subdcod 
equivalence (sutcod,evrrms(1)) 

byte evsupp(Z&) ! supplemertary event information 


ee Se See oe Sed ee ee ee) 
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CCMMON BLOCK NETINEF contains tables and veriebles that 
Popmesent tee current state of the network. 
cemmon/netinf/ istat,lstat,modlns,vhimp,vhram, 
+ l@not,ilrtof 

Bgwe istat(nimes) ! imp status tbl 

byte lstat(nlines) line status table 
byte modlns(4,nlines) network topology tbl 
byte vhimp(ahosts} host-imp mapping tbl 
tyte vhnam(10,nnosts) virtual hest nares 
real imtodf Wie forsean IMF 

reat lmtor MISE tor a line 


Cum C= Sw ewe Gu eo 


GCOMmGH BLOGH PALING contains infc on pending action List 
common/palinf/pél,pap,repeat 


cyte pal(palsz) ! pending actions list 

integer pap ! pMtinter for yal 

logical repeat ! true when we are repeating 
! 


previous events fron pal 


COMMON BLOGE FOCOUT contains info on voice output mses 
commor/vocout/votfleg,logtty,vorur,vorid,vorseg 

begrcal vefhe emue 2a volce Output turned cn 
imgeeer logtity iy it sf Ore POneeEr TTY 

integer vomnr message nr withi3a event 

byte vomid(Z,avmse) code and message number 

byte vomsg(&,nvmrsg) file names where phoneme 

S Uleeese are stored 


ow e*—8 =e—8 ¢- ow 


Clio BEGG ViaTA ccontains phoneme info fcr current 
voice output messéege 
COmmou/vaeata/V¥rhon,intl,rate,nv 


integer*4 vphcn(maxyv) iSpaeremes Tc be CUtPUt 
byte infl(mexv), rate(maxv) ! inflection and rate 
byte aF¥ ' nr of phoneres 


CORmOw PELOGR MISC ccntains gemeral purpose stu? 

common/misc/adate,atime, pa ey fc ee fa , ttrode 

byte adate(12) !' ascii date 

byte atime( &; ' ascii time 

byte lbuf(&2) ' scratch buffer. only use for 
mebeoomorary Storege of data. 

integer sttyfd fee C@eSCr for summ tty 

! 

! 


integer ltryfs fomrerdescr for logger tly 
integer ttmode(Z) Lty modes 


COmepOn BROCE TIYINY ccntains glecal data structure ‘cr 
command handling process 

Ceommoen/ttyimf/ chan,ccode,delchr,aellin,corplt 

limteeer chen mau. ecaoan fer Summ TTY 

Pete eer ccode Mecccwe TOr Current opr cird 
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begicad! gelchr 
logical dellin 
Lee real comrl t 


char was pe wene chr 7 
Char was delete line 
command line completed 


trie 12 
) ‘Emue it 
l twee itr 


COMMON BLOCK IPCF ccntains necessary variables and 
buffers meeded to implement priritive IPCF 
cormrron/ipcet/ msg,rcevmse, a Gerd wewmmse, 20fr.ail 


byte mee(mlen) 
byte revmsg(mlen) 
integer spid 
impee@er dpid 
logical newmrse 
leg@ical nemeil 


send message buffer 
receive message buffer 
! sending PID 
. CES tinatloOmrrelL 
[ tmee ll aewet Sfagas arrived. 
f true when mail ints disabled 


COMMON BPLOGE ENVIR contains info used dy doth parent 


ama Chivid processes in CNCC model. 


eng im det. 


Saved in disk file 


common/envir/parent,ttyin 


ipmméeszer parent 
limbee@er ttyin 
TABLE 


parerwever Remds = 
ayte cmds(4,2cmas) 


OF VALID OPZRATOR 
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cmds(1,j) - nr of chars 

omds(2,3) - 1st ckar of 

Cais (2%) - @md cauar of 

cma@s(4,j; - Srd char of 

@mate (omds(i,i), i=1, 
gate (emas(i,2), i= 1, 
@ate (@mms(i,5), t = lw 
daiwa (cemds(i,4), i=1, 
Mata (emds(ins), i= 1 
@ewe (ome@s(i,6), i=l, 
@etea (Crs (i,7), i=l, 
data (cmde(i,8), i =1, 
@rape (Omes(i,g), i=l, 
Gemma (ema@s(i,10), i =1, 


pie 20r Chee process 
iS pideet OFT eRIN ~precess 


COMMANDS 


(TRG eereopr corrends 
! tacle of legal commands 


Mie ce CMmmania 
jth command 
Jt commend 
jtn command 


Mine Be C07 
@)/1, P°,2,2/ 
m/1, 5 ,o,a/ 
We. >, Uae / 
“A)/3, 7D, ELS 
A\/3 ROE ON / 
mo. 8 4 fly ck / 
Ave, RR, % = Ps 
fei. Ce, 8/ 

mp yi, L ,0,0/ 
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= 
ae) 


CMOnNN NIP AMF & 


V 


UTTERA 


ZERO 
ONE 
TWC 
THRE & 
FOUR 
FIVE 
SIX 
SEVEN 
eG Et 
NINE 


OICE 


NCE 


APPENDIX C 


INPUT VOCABULARY 


S Oren HODEL 


CUICKP 
STATUS 


RINT 
De? 


REPORT 


TERUFUT SUMMARY 
DAYSUMM 


DUMP 
lag 
DOM? 
DUMP 
MP 
Deer 


bor 
Ge 2 
Mee 
IMP4 
Lr S 
ieaus 


RELOAD 
RELOAD 
RELOAD 
RSLCAD 
RELOAD 
RZLOAD 
CLAAR 
CLEAP 
CLEA 
CLEAR 
CLEAR 
CLZAR 


IMP1 
PZ 
WPS 
ears, 
TMPS 
PS 
CRASH 
CRASE 
CRASH 
CRASH 
CRASE 
CRASE 


aca 
IP 2 
ies 
IMF4 
ghee 
er © 


CUTPUT STRING 


OMIM OK CAME Y 


me y 

PREPC: 
CREEPS: 
CHEPT 
THEPL: 


?DUIMPSYS@8$ 415 
MoU IMPSTSeb sacs 
Pete loee oUe> 
?DUIMPSYS 885049 
7DUIMPSYS8&ESZSS 
?DUIMPSYS&SESOES 


(Rei MPSTSsgos@l> 
Cie PSYSooSU2s 
Cuabie slog yy. > 
Peg libs 1S995C4> 
PEBEAMPSYSY9SU5$ 
Bae ii rotogss0o> 
7BCLEARCRASHS1> 
ECL GARCRASES cs 
2BCLEARCRASHSS$ 
TECIWARCRASHS49 
meemenRCRASESSS 
7BCLEARCRASHSE$ 


Mane UP LINE I 
MARK UP LINEZ 
Mame UP LINES 
MARE UP LINEE 
MARE UP LINE? 
Pare UP LINES 
wnmes UP LIND 14 


CORRECT? CKSUM IMP1 
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2?LU1$ 
Cauzs 
il Upees 
2LUSS 
?LU7§¢ 
?LUSS 
?7LU145 
?BHACMEM IMP1$1$ 





HP oP HP ope op 
YN CoP ANH 


i 
J 


COnr eel CRSUM IMPe 
CORRECT CRSUM IMPS 
CORRECT CKSUM IMP4 
COnmeac) CKSUP IMPS 
CORRECT CKSUM IMP6 
SEND LOCAL rIL& 
T2LETE FILS 

RENAME 

PRINT DIRECTORY 

HOW MUCH DISK SPAaCce 
LOG TLS 

TOPOLOGY 

NCC 

SOLIS 

A 

NSE 

CCINS TAS 

ARPANET GATEWAY 
TLL2 

NDS 

BLACKER FRONT END 
FLACKER TAS 

PT RS 

TESTFILE 
OCRATCEPILE 

TMP 1 

IMP 
IMP 
Tier 
IMP 
Ltr 
LOGIN TO UNIX 
PASSWORD 

CHANG? DIR CCINS 
STARTUP THE NCC 
Swor Th Sister 
BOST SUMMTARY 
BROADCAST FILs 
RELOAD 

MODEY ZERO 

MODEM ONE 

MODEM TWO 

wel Stee oli Cs 
ZSCAPE 

POLETE CEARACTER 
CANC2L 


MY (ro CAN) 
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PPHACMEM IMP2S52$ 
?BHACMEM IMP2$35$ 
7BEACMEM IMP4545 
TBHACMEM IMP&S5$ 
Cheacrer I(MPesss 
x5 

? DEL 

?REN 

Tee 

ee 

OueCC..LCGS 
CONCCZNETS 

As 

25) 

45 

eS 

Co 

8$ 

12 

sags 

14 

19 

ras 

fester l LES 
SCRALCHEILES 

1s 

ee 

ope 

4$ 

ae 

eS 

PrEaArk mh Y<cr> 
CUNIX PASSWDD 

CD COINS<cr> 

EME CIMET CHAT<cr> 
fend 

CHEPE: 

es 

PREL 

Ps) 

1 

z 

N¥TSTATS 

esc 

Ciel alee, 
Geer > 








APPENDIX ff 
VOICE OUTPUT MESSAGES 


INTERNAL 

NUMBER VOLGE MESSAGE 
G1¢1 IMP ## errors correctec. 
C321 IMP ## has been relcadecd. 
10012 bine #9 1S.up. 
1¢22 Iageore is Loopede 
1203 fine we is dcwn. 
17g1 IMP ## is down. 
12g Line ## is down. 
1421 TMP gr bad fest data checksum. 
EOO'1 tee, Ger trap condition. 
16901 IMP i crash report. 
18¢@1 fice wr is down plus side. 
1802 Line ## is down rintus side. 
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